Dataset statistics
Number of variables | 13 |
---|---|
Number of observations | 100 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 10.8 KiB |
Average record size in memory | 110.3 B |
Variable types
Numeric | 4 |
---|---|
Text | 3 |
Categorical | 6 |
Dataset
Description | Sample |
---|---|
Author | 데이터마케팅코리아 |
URL | https://www.bigdata-culture.kr/bigdata/user/data_market/detail.do?id=ddfc3c84-9a18-4f0c-8dee-985be3740462 |
CHNNEL_CL_NM has constant value "" | Constant |
UPPER_CTGRY_NM has constant value "" | Constant |
LWPRT_CTGRY_NM has constant value "" | Constant |
SEQ_NO is highly overall correlated with NTCE_DT | High correlation |
DPI_VALUE is highly overall correlated with RSPN_CO and 2 other fields | High correlation |
RSPN_CO is highly overall correlated with DPI_VALUE and 1 other fields | High correlation |
COMMENT_CO is highly overall correlated with DPI_VALUE and 1 other fields | High correlation |
NTCE_DT is highly overall correlated with SEQ_NO and 1 other fields | High correlation |
RECOMEND_CO is highly overall correlated with DPI_VALUE and 1 other fields | High correlation |
RECOMEND_CO is highly imbalanced (84.9%) | Imbalance |
SEQ_NO has unique values | Unique |
CNTNTS_ID has unique values | Unique |
TITLE_NM has unique values | Unique |
CNTNTS_URL has unique values | Unique |
DPI_VALUE has 29 (29.0%) zeros | Zeros |
RSPN_CO has 35 (35.0%) zeros | Zeros |
COMMENT_CO has 57 (57.0%) zeros | Zeros |
Reproduction
Analysis started | 2023-12-10 09:47:59.355235 |
---|---|
Analysis finished | 2023-12-10 09:48:04.467558 |
Duration | 5.11 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
SEQ_NO
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 344195.67 |
Minimum | 339122 |
---|---|
Maximum | 349585 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 339122 |
---|---|
5-th percentile | 339336.25 |
Q1 | 341763 |
median | 344019 |
Q3 | 346799.25 |
95-th percentile | 348956.3 |
Maximum | 349585 |
Range | 10463 |
Interquartile range (IQR) | 5036.25 |
Descriptive statistics
Standard deviation | 3079.1259 |
---|---|
Coefficient of variation (CV) | 0.008945859 |
Kurtosis | -1.1608818 |
Mean | 344195.67 |
Median Absolute Deviation (MAD) | 2586 |
Skewness | -0.0034572934 |
Sum | 34419567 |
Variance | 9481016.5 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
340794 | 1 | 1.0% |
346124 | 1 | 1.0% |
346679 | 1 | 1.0% |
346836 | 1 | 1.0% |
346298 | 1 | 1.0% |
346627 | 1 | 1.0% |
346196 | 1 | 1.0% |
346482 | 1 | 1.0% |
346583 | 1 | 1.0% |
346275 | 1 | 1.0% |
Other values (90) | 90 |
Value | Count | Frequency (%) |
339122 | 1 | |
339147 | 1 | |
339177 | 1 | |
339202 | 1 | |
339303 | 1 | |
339338 | 1 | |
339397 | 1 | |
339416 | 1 | |
339447 | 1 | |
339579 | 1 |
Value | Count | Frequency (%) |
349585 | 1 | |
349477 | 1 | |
349250 | 1 | |
349128 | 1 | |
349000 | 1 | |
348954 | 1 | |
348893 | 1 | |
348774 | 1 | |
348492 | 1 | |
348486 | 1 |
CNTNTS_ID
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 32 |
---|---|
Median length | 32 |
Mean length | 32 |
Min length | 32 |
Characters and Unicode
Total characters | 3200 |
---|---|
Distinct characters | 16 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 100 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 8230ae14f43711eb9925002b67f7b0e1 |
---|---|
2nd row | 8230d529f43711eb86c9002b67f7b0e1 |
3rd row | 076dcf82565211ebacaa70c94e625020 |
4th row | fbca4109565111eb927e70c94e625020 |
5th row | ee2b558c565111ebb98870c94e625020 |
Value | Count | Frequency (%) |
8230ae14f43711eb9925002b67f7b0e1 | 1 | 1.0% |
82347ed3f43711eba63b002b67f7b0e1 | 1 | 1.0% |
82358fcef43711ebb1d4002b67f7b0e1 | 1 | 1.0% |
82351ad0f43711ebbecb002b67f7b0e1 | 1 | 1.0% |
82356898f43711ebb464002b67f7b0e1 | 1 | 1.0% |
8234f3eff43711eb8d5f002b67f7b0e1 | 1 | 1.0% |
8235419cf43711ebbc86002b67f7b0e1 | 1 | 1.0% |
82354201f43711eba76b002b67f7b0e1 | 1 | 1.0% |
82351ab9f43711ebb681002b67f7b0e1 | 1 | 1.0% |
823541b2f43711ebb3fb002b67f7b0e1 | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
1 | 359 | |
b | 356 | |
0 | 346 | |
7 | 322 | |
2 | 271 | |
e | 241 | |
3 | 224 | |
f | 222 | |
6 | 177 | 5.5% |
8 | 162 | 5.1% |
Other values (6) | 520 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 2192 | |
Lowercase Letter | 1008 |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
1 | 359 | |
0 | 346 | |
7 | 322 | |
2 | 271 | |
3 | 224 | |
6 | 177 | |
8 | 162 | |
4 | 162 | |
5 | 92 | 4.2% |
9 | 77 | 3.5% |
Lowercase Letter
Value | Count | Frequency (%) |
b | 356 | |
e | 241 | |
f | 222 | |
a | 78 | 7.7% |
c | 62 | 6.2% |
d | 49 | 4.9% |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 2192 | |
Latin | 1008 |
Most frequent character per script
Common
Value | Count | Frequency (%) |
1 | 359 | |
0 | 346 | |
7 | 322 | |
2 | 271 | |
3 | 224 | |
6 | 177 | |
8 | 162 | |
4 | 162 | |
5 | 92 | 4.2% |
9 | 77 | 3.5% |
Latin
Value | Count | Frequency (%) |
b | 356 | |
e | 241 | |
f | 222 | |
a | 78 | 7.7% |
c | 62 | 6.2% |
d | 49 | 4.9% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 3200 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1 | 359 | |
b | 356 | |
0 | 346 | |
7 | 322 | |
2 | 271 | |
e | 241 | |
3 | 224 | |
f | 222 | |
6 | 177 | 5.5% |
8 | 162 | 5.1% |
Other values (6) | 520 |
CHNNEL_CL_NM
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
news |
---|
Length
Max length | 4 |
---|---|
Median length | 4 |
Mean length | 4 |
Min length | 4 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | news |
---|---|
2nd row | news |
3rd row | news |
4th row | news |
5th row | news |
Common Values
Value | Count | Frequency (%) |
news | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
news | 100 |
CHNNEL_NM
Categorical
Distinct | 36 |
---|---|
Distinct (%) | 36.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
연합뉴스 | |
---|---|
뉴시스 | |
동아일보 | |
한국일보 | |
오마이뉴스 | 3 |
Other values (31) |
Length
Max length | 7 |
---|---|
Median length | 4 |
Mean length | 3.95 |
Min length | 3 |
Unique
Unique | 16 ? |
---|---|
Unique (%) | 16.0% |
Sample
1st row | 오마이뉴스 |
---|---|
2nd row | 조선일보 |
3rd row | 한국일보 |
4th row | 머니투데이 |
5th row | 서울경제 |
Common Values
Value | Count | Frequency (%) |
연합뉴스 | 21 | |
뉴시스 | 15 | |
동아일보 | 7 | 7.0% |
한국일보 | 5 | 5.0% |
오마이뉴스 | 3 | 3.0% |
KBS | 3 | 3.0% |
더팩트 | 3 | 3.0% |
SBS | 3 | 3.0% |
SBS Biz | 2 | 2.0% |
경향신문 | 2 | 2.0% |
Other values (26) | 36 |
Length
Value | Count | Frequency (%) |
연합뉴스 | 21 | |
뉴시스 | 15 | |
동아일보 | 7 | 6.9% |
한국일보 | 5 | 4.9% |
sbs | 5 | 4.9% |
오마이뉴스 | 3 | 2.9% |
kbs | 3 | 2.9% |
더팩트 | 3 | 2.9% |
조선일보 | 2 | 2.0% |
중앙일보 | 2 | 2.0% |
Other values (26) | 36 |
TITLE_NM
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 59 |
---|---|
Median length | 35.5 |
Mean length | 28.56 |
Min length | 10 |
Characters and Unicode
Total characters | 2856 |
---|---|
Distinct characters | 538 |
Distinct categories | 13 ? |
Distinct scripts | 4 ? |
Distinct blocks | 7 ? |
Unique
Unique | 100 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | 사람들 배불리 먹이고 싶어서 13년째 김밥 천 원에 팝니다 |
---|---|
2nd row | [2021 신춘문예] 조선일보 2021 신춘문예 당선자들 |
3rd row | [2021 한국일보 신춘문예] 동시 당선작 '검은 고양이' |
4th row | '집콕'이지만 덜 지루하게 새해를 맞이하는 법 |
5th row | 연말 ‘한탕’ 욕심에...‘숙박업 객실 제한’ 해돋이 명소에선 나몰라라 |
Value | Count | Frequency (%) |
날씨 | 4 | 0.6% |
2021 | 4 | 0.6% |
신춘문예 | 3 | 0.5% |
출시 | 3 | 0.5% |
관악구 | 2 | 0.3% |
주말 | 2 | 0.3% |
눈 | 2 | 0.3% |
열풍 | 2 | 0.3% |
사업 | 2 | 0.3% |
첫 | 2 | 0.3% |
Other values (598) | 621 |
Most occurring characters
Value | Count | Frequency (%) |
547 | 19.2% | |
' | 58 | 2.0% |
이 | 37 | 1.3% |
, | 36 | 1.3% |
… | 30 | 1.1% |
한 | 30 | 1.1% |
2 | 27 | 0.9% |
0 | 27 | 0.9% |
[ | 24 | 0.8% |
지 | 24 | 0.8% |
Other values (528) | 2016 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1829 | |
Space Separator | 547 | 19.2% |
Other Punctuation | 168 | 5.9% |
Decimal Number | 117 | 4.1% |
Uppercase Letter | 73 | 2.6% |
Open Punctuation | 29 | 1.0% |
Close Punctuation | 29 | 1.0% |
Lowercase Letter | 22 | 0.8% |
Initial Punctuation | 16 | 0.6% |
Final Punctuation | 15 | 0.5% |
Other values (3) | 11 | 0.4% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 37 | 2.0% |
한 | 30 | 1.6% |
지 | 24 | 1.3% |
사 | 23 | 1.3% |
가 | 23 | 1.3% |
다 | 22 | 1.2% |
에 | 22 | 1.2% |
는 | 21 | 1.1% |
의 | 21 | 1.1% |
전 | 19 | 1.0% |
Other values (456) | 1587 |
Uppercase Letter
Value | Count | Frequency (%) |
T | 9 | |
S | 8 | 11.0% |
J | 5 | 6.8% |
N | 5 | 6.8% |
C | 5 | 6.8% |
M | 5 | 6.8% |
G | 5 | 6.8% |
E | 4 | 5.5% |
F | 4 | 5.5% |
P | 3 | 4.1% |
Other values (11) | 20 |
Lowercase Letter
Value | Count | Frequency (%) |
i | 3 | |
e | 3 | |
s | 2 | |
h | 2 | |
o | 2 | |
p | 2 | |
t | 2 | |
w | 1 | 4.5% |
n | 1 | 4.5% |
g | 1 | 4.5% |
Other values (3) | 3 |
Other Punctuation
Value | Count | Frequency (%) |
' | 58 | |
, | 36 | |
… | 30 | |
· | 17 | 10.1% |
. | 16 | 9.5% |
" | 5 | 3.0% |
? | 3 | 1.8% |
: | 1 | 0.6% |
! | 1 | 0.6% |
% | 1 | 0.6% |
Decimal Number
Value | Count | Frequency (%) |
2 | 27 | |
0 | 27 | |
1 | 22 | |
5 | 11 | |
3 | 10 | 8.5% |
4 | 5 | 4.3% |
9 | 5 | 4.3% |
6 | 4 | 3.4% |
7 | 3 | 2.6% |
8 | 3 | 2.6% |
Math Symbol
Value | Count | Frequency (%) |
× | 1 | |
~ | 1 | |
∙ | 1 | |
+ | 1 | |
< | 1 | |
> | 1 |
Open Punctuation
Value | Count | Frequency (%) |
[ | 24 | |
( | 5 | 17.2% |
Close Punctuation
Value | Count | Frequency (%) |
] | 24 | |
) | 5 | 17.2% |
Final Punctuation
Value | Count | Frequency (%) |
’ | 10 | |
” | 5 |
Initial Punctuation
Value | Count | Frequency (%) |
‘ | 10 | |
“ | 6 |
Other Symbol
Value | Count | Frequency (%) |
㎏ | 1 | |
㎝ | 1 |
Space Separator
Value | Count | Frequency (%) |
547 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1825 | |
Common | 932 | |
Latin | 95 | 3.3% |
Han | 4 | 0.1% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 37 | 2.0% |
한 | 30 | 1.6% |
지 | 24 | 1.3% |
사 | 23 | 1.3% |
가 | 23 | 1.3% |
다 | 22 | 1.2% |
에 | 22 | 1.2% |
는 | 21 | 1.2% |
의 | 21 | 1.2% |
전 | 19 | 1.0% |
Other values (452) | 1583 |
Common
Value | Count | Frequency (%) |
547 | ||
' | 58 | 6.2% |
, | 36 | 3.9% |
… | 30 | 3.2% |
2 | 27 | 2.9% |
0 | 27 | 2.9% |
[ | 24 | 2.6% |
] | 24 | 2.6% |
1 | 22 | 2.4% |
· | 17 | 1.8% |
Other values (28) | 120 | 12.9% |
Latin
Value | Count | Frequency (%) |
T | 9 | 9.5% |
S | 8 | 8.4% |
J | 5 | 5.3% |
N | 5 | 5.3% |
C | 5 | 5.3% |
M | 5 | 5.3% |
G | 5 | 5.3% |
E | 4 | 4.2% |
F | 4 | 4.2% |
P | 3 | 3.2% |
Other values (24) | 42 |
Han
Value | Count | Frequency (%) |
斷 | 1 | |
想 | 1 | |
必 | 1 | |
詩 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1825 | |
ASCII | 945 | |
Punctuation | 61 | 2.1% |
None | 18 | 0.6% |
CJK | 4 | 0.1% |
CJK Compat | 2 | 0.1% |
Math Operators | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
547 | ||
' | 58 | 6.1% |
, | 36 | 3.8% |
2 | 27 | 2.9% |
0 | 27 | 2.9% |
[ | 24 | 2.5% |
] | 24 | 2.5% |
1 | 22 | 2.3% |
. | 16 | 1.7% |
5 | 11 | 1.2% |
Other values (52) | 153 | 16.2% |
Hangul
Value | Count | Frequency (%) |
이 | 37 | 2.0% |
한 | 30 | 1.6% |
지 | 24 | 1.3% |
사 | 23 | 1.3% |
가 | 23 | 1.3% |
다 | 22 | 1.2% |
에 | 22 | 1.2% |
는 | 21 | 1.2% |
의 | 21 | 1.2% |
전 | 19 | 1.0% |
Other values (452) | 1583 |
Punctuation
Value | Count | Frequency (%) |
… | 30 | |
’ | 10 | 16.4% |
‘ | 10 | 16.4% |
“ | 6 | 9.8% |
” | 5 | 8.2% |
None
Value | Count | Frequency (%) |
· | 17 | |
× | 1 | 5.6% |
CJK
Value | Count | Frequency (%) |
斷 | 1 | |
想 | 1 | |
必 | 1 | |
詩 | 1 |
Math Operators
Value | Count | Frequency (%) |
∙ | 1 |
CJK Compat
Value | Count | Frequency (%) |
㎏ | 1 | |
㎝ | 1 |
NTCE_DT
Categorical
HIGH CORRELATION
 
Distinct | 40 |
---|---|
Distinct (%) | 40.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
2021-01-21 00:00:00 | |
---|---|
2021-01-11 00:00:00 | |
2021-01-07 00:00:00 | 6 |
2021-01-28 00:00:00 | 6 |
2021-01-14 00:00:00 | 6 |
Other values (35) |
Length
Max length | 19 |
---|---|
Median length | 19 |
Mean length | 19 |
Min length | 19 |
Unique
Unique | 22 ? |
---|---|
Unique (%) | 22.0% |
Sample
1st row | 2021-01-01 00:00:00 |
---|---|
2nd row | 2021-01-01 00:00:00 |
3rd row | 2021-01-01 04:31:00 |
4th row | 2021-01-01 07:00:00 |
5th row | 2021-01-01 10:01:00 |
Common Values
Value | Count | Frequency (%) |
2021-01-21 00:00:00 | 8 | 8.0% |
2021-01-11 00:00:00 | 7 | 7.0% |
2021-01-07 00:00:00 | 6 | 6.0% |
2021-01-28 00:00:00 | 6 | 6.0% |
2021-01-14 00:00:00 | 6 | 6.0% |
2021-01-26 00:00:00 | 5 | 5.0% |
2021-01-22 00:00:00 | 5 | 5.0% |
2021-01-04 00:00:00 | 5 | 5.0% |
2021-01-06 00:00:00 | 4 | 4.0% |
2021-01-27 00:00:00 | 4 | 4.0% |
Other values (30) | 44 |
Length
Value | Count | Frequency (%) |
00:00:00 | 86 | |
2021-01-04 | 9 | 4.5% |
2021-01-21 | 8 | 4.0% |
2021-01-11 | 7 | 3.5% |
2021-01-07 | 6 | 3.0% |
2021-01-28 | 6 | 3.0% |
2021-01-14 | 6 | 3.0% |
2021-01-06 | 6 | 3.0% |
2021-01-01 | 6 | 3.0% |
2021-01-26 | 5 | 2.5% |
Other values (31) | 55 |
DPI_VALUE
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 33 |
---|---|
Distinct (%) | 33.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13.602 |
Minimum | 0 |
---|---|
Maximum | 311.2 |
Zeros | 29 |
Zeros (%) | 29.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0.4 |
Q3 | 2.95 |
95-th percentile | 70.55 |
Maximum | 311.2 |
Range | 311.2 |
Interquartile range (IQR) | 2.95 |
Descriptive statistics
Standard deviation | 48.27171 |
---|---|
Coefficient of variation (CV) | 3.5488685 |
Kurtosis | 25.214569 |
Mean | 13.602 |
Median Absolute Deviation (MAD) | 0.4 |
Skewness | 4.8983597 |
Sum | 1360.2 |
Variance | 2330.158 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0 | 29 | |
0.2 | 13 | |
0.4 | 13 | |
1.0 | 7 | 7.0% |
0.6 | 5 | 5.0% |
4.6 | 3 | 3.0% |
14.6 | 2 | 2.0% |
0.8 | 2 | 2.0% |
1.4 | 2 | 2.0% |
165.8 | 1 | 1.0% |
Other values (23) | 23 |
Value | Count | Frequency (%) |
0.0 | 29 | |
0.2 | 13 | |
0.4 | 13 | |
0.6 | 5 | 5.0% |
0.8 | 2 | 2.0% |
1.0 | 7 | 7.0% |
1.2 | 1 | 1.0% |
1.4 | 2 | 2.0% |
1.8 | 1 | 1.0% |
2.4 | 1 | 1.0% |
Value | Count | Frequency (%) |
311.2 | 1 | |
285.6 | 1 | |
165.8 | 1 | |
155.8 | 1 | |
96.2 | 1 | |
69.2 | 1 | |
37.2 | 1 | |
27.0 | 1 | |
26.2 | 1 | |
23.8 | 1 |
RSPN_CO
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 31 |
---|---|
Distinct (%) | 31.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 40.53 |
Minimum | 0 |
---|---|
Maximum | 948 |
Zeros | 35 |
Zeros (%) | 35.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 1 |
Q3 | 7 |
95-th percentile | 133.3 |
Maximum | 948 |
Range | 948 |
Interquartile range (IQR) | 7 |
Descriptive statistics
Standard deviation | 152.80029 |
---|---|
Coefficient of variation (CV) | 3.770054 |
Kurtosis | 25.925864 |
Mean | 40.53 |
Median Absolute Deviation (MAD) | 1 |
Skewness | 5.0870998 |
Sum | 4053 |
Variance | 23347.928 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 35 | |
1 | 19 | |
2 | 7 | 7.0% |
3 | 6 | 6.0% |
4 | 4 | 4.0% |
5 | 2 | 2.0% |
11 | 2 | 2.0% |
7 | 2 | 2.0% |
51 | 1 | 1.0% |
16 | 1 | 1.0% |
Other values (21) | 21 |
Value | Count | Frequency (%) |
0 | 35 | |
1 | 19 | |
2 | 7 | 7.0% |
3 | 6 | 6.0% |
4 | 4 | 4.0% |
5 | 2 | 2.0% |
6 | 1 | 1.0% |
7 | 2 | 2.0% |
8 | 1 | 1.0% |
11 | 2 | 2.0% |
Value | Count | Frequency (%) |
948 | 1 | |
838 | 1 | |
829 | 1 | |
295 | 1 | |
177 | 1 | |
131 | 1 | |
130 | 1 | |
105 | 1 | |
96 | 1 | |
53 | 1 |
COMMENT_CO
Real number (ℝ)
HIGH CORRELATION
  ZEROS
 
Distinct | 21 |
---|---|
Distinct (%) | 21.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 13.52 |
Minimum | 0 |
---|---|
Maximum | 359 |
Zeros | 57 |
Zeros (%) | 57.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.0 KiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 0 |
Q1 | 0 |
median | 0 |
Q3 | 2 |
95-th percentile | 37.7 |
Maximum | 359 |
Range | 359 |
Interquartile range (IQR) | 2 |
Descriptive statistics
Standard deviation | 51.863745 |
---|---|
Coefficient of variation (CV) | 3.8360758 |
Kurtosis | 26.650803 |
Mean | 13.52 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 5.0326084 |
Sum | 1352 |
Variance | 2689.8481 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0 | 57 | |
1 | 14 | 14.0% |
2 | 7 | 7.0% |
8 | 2 | 2.0% |
7 | 2 | 2.0% |
5 | 2 | 2.0% |
3 | 2 | 2.0% |
6 | 1 | 1.0% |
29 | 1 | 1.0% |
34 | 1 | 1.0% |
Other values (11) | 11 | 11.0% |
Value | Count | Frequency (%) |
0 | 57 | |
1 | 14 | 14.0% |
2 | 7 | 7.0% |
3 | 2 | 2.0% |
5 | 2 | 2.0% |
6 | 1 | 1.0% |
7 | 2 | 2.0% |
8 | 2 | 2.0% |
9 | 1 | 1.0% |
10 | 1 | 1.0% |
Value | Count | Frequency (%) |
359 | 1 | |
242 | 1 | |
240 | 1 | |
152 | 1 | |
108 | 1 | |
34 | 1 | |
32 | 1 | |
29 | 1 | |
27 | 1 | |
17 | 1 |
RECOMEND_CO
Categorical
HIGH CORRELATION
  IMBALANCE
 
Distinct | 4 |
---|---|
Distinct (%) | 4.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
0 | |
---|---|
2 | 2 |
5 | 1 |
13 | 1 |
Length
Max length | 2 |
---|---|
Median length | 1 |
Mean length | 1.01 |
Min length | 1 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 2.0% |
Sample
1st row | 0 |
---|---|
2nd row | 0 |
3rd row | 0 |
4th row | 0 |
5th row | 5 |
Common Values
Value | Count | Frequency (%) |
0 | 96 | |
2 | 2 | 2.0% |
5 | 1 | 1.0% |
13 | 1 | 1.0% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
0 | 96 | |
2 | 2 | 2.0% |
5 | 1 | 1.0% |
13 | 1 | 1.0% |
CNTNTS_URL
Text
UNIQUE
 
Distinct | 100 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
Length
Max length | 97 |
---|---|
Median length | 97 |
Mean length | 96.72 |
Min length | 95 |
Characters and Unicode
Total characters | 9672 |
---|---|
Distinct characters | 34 |
Distinct categories | 5 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 100 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=047&aid=0002297434 |
---|---|
2nd row | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=023&aid=0003587130 |
3rd row | https://news.naver.com/main/read.nhn?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=469&aid=0000567922 |
4th row | https://news.naver.com/main/read.nhn?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=008&aid=0004522315 |
5th row | https://news.naver.com/main/read.nhn?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=011&aid=0003850216 |
Value | Count | Frequency (%) |
https://news.naver.com/main/read.naver?mode=ls2d&mid=shm&sid1=103&sid2=245&oid=047&aid=0002297434 | 1 | 1.0% |
https://news.naver.com/main/read.naver?mode=ls2d&mid=shm&sid1=103&sid2=245&oid=020&aid=0003332852 | 1 | 1.0% |
https://news.naver.com/main/read.naver?mode=ls2d&mid=shm&sid1=103&sid2=245&oid=003&aid=0010308075 | 1 | 1.0% |
https://news.naver.com/main/read.naver?mode=ls2d&mid=shm&sid1=103&sid2=245&oid=025&aid=0003071594 | 1 | 1.0% |
https://news.naver.com/main/read.naver?mode=ls2d&mid=shm&sid1=103&sid2=245&oid=020&aid=0003333743 | 1 | 1.0% |
https://news.naver.com/main/read.naver?mode=ls2d&mid=shm&sid1=103&sid2=245&oid=003&aid=0010307067 | 1 | 1.0% |
https://news.naver.com/main/read.naver?mode=ls2d&mid=shm&sid1=103&sid2=245&oid=047&aid=0002299614 | 1 | 1.0% |
https://news.naver.com/main/read.naver?mode=ls2d&mid=shm&sid1=103&sid2=245&oid=001&aid=0012153297 | 1 | 1.0% |
https://news.naver.com/main/read.naver?mode=ls2d&mid=shm&sid1=103&sid2=245&oid=032&aid=0003055227 | 1 | 1.0% |
https://news.naver.com/main/read.naver?mode=ls2d&mid=shm&sid1=103&sid2=245&oid=023&aid=0003591176 | 1 | 1.0% |
Other values (90) | 90 |
Most occurring characters
Value | Count | Frequency (%) |
d | 700 | 7.2% |
= | 600 | 6.2% |
i | 600 | 6.2% |
0 | 585 | 6.0% |
s | 500 | 5.2% |
& | 500 | 5.2% |
m | 500 | 5.2% |
e | 486 | 5.0% |
a | 486 | 5.0% |
2 | 430 | 4.4% |
Other values (24) | 4285 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 5172 | |
Decimal Number | 2200 | |
Other Punctuation | 1400 | 14.5% |
Math Symbol | 600 | 6.2% |
Uppercase Letter | 300 | 3.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
d | 700 | |
i | 600 | |
s | 500 | |
m | 500 | |
e | 486 | |
a | 486 | |
n | 414 | |
o | 300 | |
r | 286 | |
h | 214 | 4.1% |
Other values (5) | 686 |
Decimal Number
Value | Count | Frequency (%) |
0 | 585 | |
2 | 430 | |
1 | 347 | |
3 | 220 | 10.0% |
5 | 192 | 8.7% |
4 | 165 | 7.5% |
7 | 72 | 3.3% |
6 | 69 | 3.1% |
8 | 64 | 2.9% |
9 | 56 | 2.5% |
Other Punctuation
Value | Count | Frequency (%) |
& | 500 | |
/ | 400 | |
. | 300 | |
? | 100 | 7.1% |
: | 100 | 7.1% |
Uppercase Letter
Value | Count | Frequency (%) |
L | 100 | |
S | 100 | |
D | 100 |
Math Symbol
Value | Count | Frequency (%) |
= | 600 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 5472 | |
Common | 4200 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
d | 700 | |
i | 600 | |
s | 500 | |
m | 500 | |
e | 486 | |
a | 486 | |
n | 414 | |
o | 300 | 5.5% |
r | 286 | 5.2% |
h | 214 | 3.9% |
Other values (8) | 986 |
Common
Value | Count | Frequency (%) |
= | 600 | |
0 | 585 | |
& | 500 | |
2 | 430 | |
/ | 400 | |
1 | 347 | |
. | 300 | |
3 | 220 | 5.2% |
5 | 192 | 4.6% |
4 | 165 | 3.9% |
Other values (6) | 461 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 9672 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
d | 700 | 7.2% |
= | 600 | 6.2% |
i | 600 | 6.2% |
0 | 585 | 6.0% |
s | 500 | 5.2% |
& | 500 | 5.2% |
m | 500 | 5.2% |
e | 486 | 5.0% |
a | 486 | 5.0% |
2 | 430 | 4.4% |
Other values (24) | 4285 |
UPPER_CTGRY_NM
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
생활/문화 |
---|
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 5 |
Min length | 5 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 생활/문화 |
---|---|
2nd row | 생활/문화 |
3rd row | 생활/문화 |
4th row | 생활/문화 |
5th row | 생활/문화 |
Common Values
Value | Count | Frequency (%) |
생활/문화 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
생활/문화 | 100 |
LWPRT_CTGRY_NM
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 932.0 B |
생활문화일반 |
---|
Length
Max length | 6 |
---|---|
Median length | 6 |
Mean length | 6 |
Min length | 6 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 생활문화일반 |
---|---|
2nd row | 생활문화일반 |
3rd row | 생활문화일반 |
4th row | 생활문화일반 |
5th row | 생활문화일반 |
Common Values
Value | Count | Frequency (%) |
생활문화일반 | 100 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
생활문화일반 | 100 |
SEQ_NO | CNTNTS_ID | CHNNEL_NM | TITLE_NM | NTCE_DT | DPI_VALUE | RSPN_CO | COMMENT_CO | RECOMEND_CO | CNTNTS_URL | |
---|---|---|---|---|---|---|---|---|---|---|
SEQ_NO | 1.000 | 1.000 | 0.479 | 1.000 | 0.994 | 0.460 | 0.445 | 0.534 | 0.077 | 1.000 |
CNTNTS_ID | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
CHNNEL_NM | 0.479 | 1.000 | 1.000 | 1.000 | 0.832 | 0.662 | 0.000 | 0.000 | 0.788 | 1.000 |
TITLE_NM | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
NTCE_DT | 0.994 | 1.000 | 0.832 | 1.000 | 1.000 | 0.803 | 0.362 | 0.614 | 1.000 | 1.000 |
DPI_VALUE | 0.460 | 1.000 | 0.662 | 1.000 | 0.803 | 1.000 | 0.812 | 0.908 | 0.714 | 1.000 |
RSPN_CO | 0.445 | 1.000 | 0.000 | 1.000 | 0.362 | 0.812 | 1.000 | 0.945 | 0.196 | 1.000 |
COMMENT_CO | 0.534 | 1.000 | 0.000 | 1.000 | 0.614 | 0.908 | 0.945 | 1.000 | 0.000 | 1.000 |
RECOMEND_CO | 0.077 | 1.000 | 0.788 | 1.000 | 1.000 | 0.714 | 0.196 | 0.000 | 1.000 | 1.000 |
CNTNTS_URL | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
NTCE_DT | RECOMEND_CO | CHNNEL_NM | |
---|---|---|---|
NTCE_DT | 1.000 | 0.791 | 0.274 |
RECOMEND_CO | 0.791 | 1.000 | 0.403 |
CHNNEL_NM | 0.274 | 0.403 | 1.000 |
SEQ_NO | DPI_VALUE | RSPN_CO | COMMENT_CO | CHNNEL_NM | NTCE_DT | RECOMEND_CO | |
---|---|---|---|---|---|---|---|
SEQ_NO | 1.000 | -0.096 | -0.051 | -0.126 | 0.135 | 0.767 | 0.000 |
DPI_VALUE | -0.096 | 1.000 | 0.935 | 0.839 | 0.269 | 0.400 | 0.541 |
RSPN_CO | -0.051 | 0.935 | 1.000 | 0.679 | 0.000 | 0.115 | 0.159 |
COMMENT_CO | -0.126 | 0.839 | 0.679 | 1.000 | 0.000 | 0.244 | 0.000 |
CHNNEL_NM | 0.135 | 0.269 | 0.000 | 0.000 | 1.000 | 0.274 | 0.403 |
NTCE_DT | 0.767 | 0.400 | 0.115 | 0.244 | 0.274 | 1.000 | 0.791 |
RECOMEND_CO | 0.000 | 0.541 | 0.159 | 0.000 | 0.403 | 0.791 | 1.000 |
SEQ_NO | CNTNTS_ID | CHNNEL_CL_NM | CHNNEL_NM | TITLE_NM | NTCE_DT | DPI_VALUE | RSPN_CO | COMMENT_CO | RECOMEND_CO | CNTNTS_URL | UPPER_CTGRY_NM | LWPRT_CTGRY_NM | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | 340794 | 8230ae14f43711eb9925002b67f7b0e1 | news | 오마이뉴스 | 사람들 배불리 먹이고 싶어서 13년째 김밥 천 원에 팝니다 | 2021-01-01 00:00:00 | 285.6 | 948 | 240 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=047&aid=0002297434 | 생활/문화 | 생활문화일반 |
1 | 341011 | 8230d529f43711eb86c9002b67f7b0e1 | news | 조선일보 | [2021 신춘문예] 조선일보 2021 신춘문예 당선자들 | 2021-01-01 00:00:00 | 2.4 | 6 | 3 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=023&aid=0003587130 | 생활/문화 | 생활문화일반 |
2 | 339202 | 076dcf82565211ebacaa70c94e625020 | news | 한국일보 | [2021 한국일보 신춘문예] 동시 당선작 '검은 고양이' | 2021-01-01 04:31:00 | 1.0 | 1 | 2 | 0 | https://news.naver.com/main/read.nhn?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=469&aid=0000567922 | 생활/문화 | 생활문화일반 |
3 | 339177 | fbca4109565111eb927e70c94e625020 | news | 머니투데이 | '집콕'이지만 덜 지루하게 새해를 맞이하는 법 | 2021-01-01 07:00:00 | 0.2 | 1 | 0 | 0 | https://news.naver.com/main/read.nhn?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=008&aid=0004522315 | 생활/문화 | 생활문화일반 |
4 | 339147 | ee2b558c565111ebb98870c94e625020 | news | 서울경제 | 연말 ‘한탕’ 욕심에...‘숙박업 객실 제한’ 해돋이 명소에선 나몰라라 | 2021-01-01 10:01:00 | 19.6 | 34 | 27 | 5 | https://news.naver.com/main/read.nhn?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=011&aid=0003850216 | 생활/문화 | 생활문화일반 |
5 | 339122 | e2cbbf47565111ebbcf070c94e625020 | news | 프레시안 | [신년 詩] 사랑의 타종 | 2021-01-01 12:56:00 | 0.4 | 2 | 0 | 0 | https://news.naver.com/main/read.nhn?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=002&aid=0002166460 | 생활/문화 | 생활문화일반 |
6 | 341099 | 8230fbc2f43711ebb73d002b67f7b0e1 | news | 파이낸셜뉴스 | SM타운 라이브, 186개국·3583만 스트리밍..전세계 K팝 점령 | 2021-01-02 00:00:00 | 26.2 | 105 | 13 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=014&aid=0004557383 | 생활/문화 | 생활문화일반 |
7 | 339303 | 36739a4b565211eba38170c94e625020 | news | 조선비즈 | [김지수의 인터스텔라] “오래 버텼다, 잘 섞었다, 이날치가 되었다" 장영규 | 2021-01-02 07:01:00 | 37.2 | 96 | 32 | 13 | https://news.naver.com/main/read.nhn?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=366&aid=0000644816 | 생활/문화 | 생활문화일반 |
8 | 341209 | 8230fc30f43711ebbd45002b67f7b0e1 | news | 연합뉴스 | 삼성전자 '비스포크 인덕션' 출시 | 2021-01-03 00:00:00 | 0.2 | 1 | 0 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=001&aid=0012116003 | 생활/문화 | 생활문화일반 |
9 | 341116 | 8230fbd3f43711eb8034002b67f7b0e1 | news | YTN | 애증의 플라스틱에 예술을 담으면? | 2021-01-03 00:00:00 | 5.2 | 8 | 9 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=052&aid=0001533701 | 생활/문화 | 생활문화일반 |
SEQ_NO | CNTNTS_ID | CHNNEL_CL_NM | CHNNEL_NM | TITLE_NM | NTCE_DT | DPI_VALUE | RSPN_CO | COMMENT_CO | RECOMEND_CO | CNTNTS_URL | UPPER_CTGRY_NM | LWPRT_CTGRY_NM | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
90 | 348084 | 8236a11af43711ebb42b002b67f7b0e1 | news | 연합뉴스 | 축사하는 전해철 행정안전부 장관 | 2021-01-27 00:00:00 | 0.0 | 0 | 0 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=001&aid=0012166832 | 생활/문화 | 생활문화일반 |
91 | 348774 | 82371692f43711ebb76c002b67f7b0e1 | news | 뉴시스 | 국립민속박물관, '한국생업기술사전: 농업 편' 발간 | 2021-01-28 00:00:00 | 0.0 | 0 | 0 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=003&aid=0010318453 | 생활/문화 | 생활문화일반 |
92 | 349128 | 82376488f43711ebbc52002b67f7b0e1 | news | KBS | [개봉영화] 명품 연기가 빚어낸 가족의 민낯 ‘세자매’ 외 | 2021-01-28 00:00:00 | 1.4 | 5 | 1 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=056&aid=0010979287 | 생활/문화 | 생활문화일반 |
93 | 348335 | 8236c85ef43711eb824d002b67f7b0e1 | news | 뉴시스 | 예산 '예당호 출렁다리·황새공원' 한국관광 100선에 | 2021-01-28 00:00:00 | 0.0 | 0 | 0 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=003&aid=0010317940 | 생활/문화 | 생활문화일반 |
94 | 349000 | 82373db5f43711eba191002b67f7b0e1 | news | 한국경제TV | 영화관도 필(必)환경…'폐스크린 가방' 만든 CGV | 2021-01-28 00:00:00 | 0.0 | 0 | 0 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=215&aid=0000933089 | 생활/문화 | 생활문화일반 |
95 | 348954 | 82373d87f43711ebbf4b002b67f7b0e1 | news | 뉴시스 | '2021~2022 한국관광 100선'…'5회연속 선정' 19개소는 어디? | 2021-01-28 00:00:00 | 0.6 | 1 | 1 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=003&aid=0010317502 | 생활/문화 | 생활문화일반 |
96 | 348893 | 82373d4af43711eb8c80002b67f7b0e1 | news | 뉴시스 | 한국공연프로듀서협회, '코로나19 장기화' 따른 비상행동 | 2021-01-28 00:00:00 | 0.6 | 3 | 0 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=003&aid=0010317867 | 생활/문화 | 생활문화일반 |
97 | 349250 | 82376502f43711ebaad3002b67f7b0e1 | news | KBS | 배우 김영철, 아너 소사이어티 2천500번째 회원 | 2021-01-29 00:00:00 | 1.0 | 3 | 1 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=056&aid=0010980542 | 생활/문화 | 생활문화일반 |
98 | 349477 | 8237b272f43711eb8f21002b67f7b0e1 | news | 여성신문 | 중부지방 눈∙강추위…낮 부터 기온 올라 | 2021-01-30 00:00:00 | 0.2 | 1 | 0 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=310&aid=0000083782 | 생활/문화 | 생활문화일반 |
99 | 349585 | 8237b2def43711ebada0002b67f7b0e1 | news | 한국일보 | 편의점에서 '식사 구독' 통했네… GS25, 유료 회원 5배 급증 | 2021-01-31 00:00:00 | 3.4 | 7 | 5 | 0 | https://news.naver.com/main/read.naver?mode=LS2D&mid=shm&sid1=103&sid2=245&oid=469&aid=0000576445 | 생활/문화 | 생활문화일반 |