Dataset statistics
Number of variables | 9 |
---|---|
Number of observations | 124 |
Missing cells | 959 |
Missing cells (%) | 85.9% |
Duplicate rows | 4 |
Duplicate rows (%) | 3.2% |
Total size in memory | 9.5 KiB |
Average record size in memory | 78.1 B |
Variable types
Text | 4 |
---|---|
Numeric | 4 |
Unsupported | 1 |
Dataset
Description | 샘플 데이터 |
---|---|
Author | MBN |
URL | https://kdx.kr/data/view/29824 |
Dataset has 4 (3.2%) duplicate rows | Duplicates |
play_sec is highly overall correlated with play_hour and 1 other fields | High correlation |
play_hour is highly overall correlated with play_sec and 1 other fields | High correlation |
file_size is highly overall correlated with play_sec and 1 other fields | High correlation |
vod_seq_no has 37 (29.8%) missing values | Missing |
bcast_seq_no has 114 (91.9%) missing values | Missing |
play_sec has 114 (91.9%) missing values | Missing |
play_hour has 114 (91.9%) missing values | Missing |
file_size has 114 (91.9%) missing values | Missing |
vod_path has 114 (91.9%) missing values | Missing |
title has 114 (91.9%) missing values | Missing |
contents has 114 (91.9%) missing values | Missing |
Unnamed: 8 has 124 (100.0%) missing values | Missing |
Unnamed: 8 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2024-03-11 03:30:33.922587 |
---|---|
Analysis finished | 2024-03-11 03:30:37.290159 |
Duration | 3.37 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
vod_seq_no
Text
MISSING
 
Distinct | 81 |
---|---|
Distinct (%) | 93.1% |
Missing | 37 |
Missing (%) | 29.8% |
Memory size | 1.1 KiB |
Length
Max length | 93 |
---|---|
Median length | 58 |
Mean length | 32.678161 |
Min length | 6 |
Characters and Unicode
Total characters | 2843 |
---|---|
Distinct characters | 359 |
Distinct categories | 12 ? |
Distinct scripts | 3 ? |
Distinct blocks | 4 ? |
Unique
Unique | 78 ? |
---|---|
Unique (%) | 89.7% |
Sample
1st row | 557892 |
---|---|
2nd row | 뉴스를 파헤치고 이슈를 터트리는 뉴스, |
3rd row | 뉴스 파이터.. |
4th row | 진행을 맡은 최중락입니다. |
5th row | 저와 함께 뉴스를 철저하게 해부해주실 분, 소개합니다. |
Value | Count | Frequency (%) |
박 | 12 | 1.8% |
눈이 | 8 | 1.2% |
조사를 | 7 | 1.1% |
뉴스 | 7 | 1.1% |
또 | 7 | 1.1% |
【 | 7 | 1.1% |
것으로 | 7 | 1.1% |
】 | 7 | 1.1% |
보입니다 | 6 | 0.9% |
소환 | 6 | 0.9% |
Other values (458) | 584 |
Most occurring characters
Value | Count | Frequency (%) |
644 | 22.7% | |
이 | 71 | 2.5% |
. | 60 | 2.1% |
다 | 57 | 2.0% |
니 | 46 | 1.6% |
에 | 40 | 1.4% |
서 | 35 | 1.2% |
는 | 33 | 1.2% |
은 | 31 | 1.1% |
지 | 30 | 1.1% |
Other values (349) | 1796 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1965 | |
Space Separator | 644 | 22.7% |
Other Punctuation | 92 | 3.2% |
Decimal Number | 77 | 2.7% |
Control | 19 | 0.7% |
Math Symbol | 11 | 0.4% |
Open Punctuation | 10 | 0.4% |
Close Punctuation | 10 | 0.4% |
Lowercase Letter | 6 | 0.2% |
Uppercase Letter | 5 | 0.2% |
Other values (2) | 4 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
이 | 71 | 3.6% |
다 | 57 | 2.9% |
니 | 46 | 2.3% |
에 | 40 | 2.0% |
서 | 35 | 1.8% |
는 | 33 | 1.7% |
은 | 31 | 1.6% |
지 | 30 | 1.5% |
사 | 29 | 1.5% |
로 | 27 | 1.4% |
Other values (317) | 1566 |
Decimal Number
Value | Count | Frequency (%) |
5 | 23 | |
8 | 15 | |
2 | 12 | |
1 | 9 | 11.7% |
4 | 5 | 6.5% |
0 | 4 | 5.2% |
3 | 4 | 5.2% |
7 | 2 | 2.6% |
9 | 2 | 2.6% |
6 | 1 | 1.3% |
Other Punctuation
Value | Count | Frequency (%) |
. | 60 | |
, | 21 | 22.8% |
? | 5 | 5.4% |
" | 4 | 4.3% |
! | 2 | 2.2% |
Uppercase Letter
Value | Count | Frequency (%) |
O | 2 | |
B | 1 | |
M | 1 | |
N | 1 |
Math Symbol
Value | Count | Frequency (%) |
> | 6 | |
< | 4 | |
~ | 1 | 9.1% |
Open Punctuation
Value | Count | Frequency (%) |
【 | 7 | |
( | 3 |
Close Punctuation
Value | Count | Frequency (%) |
】 | 7 | |
) | 3 |
Lowercase Letter
Value | Count | Frequency (%) |
m | 3 | |
c | 3 |
Space Separator
Value | Count | Frequency (%) |
644 |
Control
Value | Count | Frequency (%) |
19 |
Initial Punctuation
Value | Count | Frequency (%) |
‘ | 2 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 2 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1965 | |
Common | 867 | |
Latin | 11 | 0.4% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
이 | 71 | 3.6% |
다 | 57 | 2.9% |
니 | 46 | 2.3% |
에 | 40 | 2.0% |
서 | 35 | 1.8% |
는 | 33 | 1.7% |
은 | 31 | 1.6% |
지 | 30 | 1.5% |
사 | 29 | 1.5% |
로 | 27 | 1.4% |
Other values (317) | 1566 |
Common
Value | Count | Frequency (%) |
644 | ||
. | 60 | 6.9% |
5 | 23 | 2.7% |
, | 21 | 2.4% |
19 | 2.2% | |
8 | 15 | 1.7% |
2 | 12 | 1.4% |
1 | 9 | 1.0% |
【 | 7 | 0.8% |
】 | 7 | 0.8% |
Other values (16) | 50 | 5.8% |
Latin
Value | Count | Frequency (%) |
m | 3 | |
c | 3 | |
O | 2 | |
B | 1 | 9.1% |
M | 1 | 9.1% |
N | 1 | 9.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1965 | |
ASCII | 860 | |
None | 14 | 0.5% |
Punctuation | 4 | 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
644 | ||
. | 60 | 7.0% |
5 | 23 | 2.7% |
, | 21 | 2.4% |
19 | 2.2% | |
8 | 15 | 1.7% |
2 | 12 | 1.4% |
1 | 9 | 1.0% |
> | 6 | 0.7% |
? | 5 | 0.6% |
Other values (18) | 46 | 5.3% |
Hangul
Value | Count | Frequency (%) |
이 | 71 | 3.6% |
다 | 57 | 2.9% |
니 | 46 | 2.3% |
에 | 40 | 2.0% |
서 | 35 | 1.8% |
는 | 33 | 1.7% |
은 | 31 | 1.6% |
지 | 30 | 1.5% |
사 | 29 | 1.5% |
로 | 27 | 1.4% |
Other values (317) | 1566 |
None
Value | Count | Frequency (%) |
【 | 7 | |
】 | 7 |
Punctuation
Value | Count | Frequency (%) |
‘ | 2 | |
’ | 2 |
bcast_seq_no
Real number (ℝ)
MISSING
 
Distinct | 10 |
---|---|
Distinct (%) | 100.0% |
Missing | 114 |
Missing (%) | 91.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1085455.9 |
Minimum | 1085286 |
---|---|
Maximum | 1085569 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.2 KiB |
Quantile statistics
Minimum | 1085286 |
---|---|
5-th percentile | 1085319.8 |
Q1 | 1085379.2 |
median | 1085494.5 |
Q3 | 1085496.8 |
95-th percentile | 1085568.6 |
Maximum | 1085569 |
Range | 283 |
Interquartile range (IQR) | 117.5 |
Descriptive statistics
Standard deviation | 93.657117 |
---|---|
Coefficient of variation (CV) | 8.6283668 × 10-5 |
Kurtosis | -0.58388648 |
Mean | 1085455.9 |
Median Absolute Deviation (MAD) | 68.5 |
Skewness | -0.59137155 |
Sum | 10854559 |
Variance | 8771.6556 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1085286 | 1 | 0.8% |
1085361 | 1 | 0.8% |
1085362 | 1 | 0.8% |
1085431 | 1 | 0.8% |
1085494 | 1 | 0.8% |
1085495 | 1 | 0.8% |
1085496 | 1 | 0.8% |
1085497 | 1 | 0.8% |
1085568 | 1 | 0.8% |
1085569 | 1 | 0.8% |
(Missing) | 114 |
Value | Count | Frequency (%) |
1085286 | 1 | |
1085361 | 1 | |
1085362 | 1 | |
1085431 | 1 | |
1085494 | 1 | |
1085495 | 1 | |
1085496 | 1 | |
1085497 | 1 | |
1085568 | 1 | |
1085569 | 1 |
Value | Count | Frequency (%) |
1085569 | 1 | |
1085568 | 1 | |
1085497 | 1 | |
1085496 | 1 | |
1085495 | 1 | |
1085494 | 1 | |
1085431 | 1 | |
1085362 | 1 | |
1085361 | 1 | |
1085286 | 1 |
play_sec
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 10 |
---|---|
Distinct (%) | 100.0% |
Missing | 114 |
Missing (%) | 91.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 239.6 |
Minimum | 27 |
---|---|
Maximum | 1079 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.2 KiB |
Quantile statistics
Minimum | 27 |
---|---|
5-th percentile | 29.7 |
Q1 | 37 |
median | 55 |
Q3 | 207 |
95-th percentile | 893.6 |
Maximum | 1079 |
Range | 1052 |
Interquartile range (IQR) | 170 |
Descriptive statistics
Standard deviation | 353.80666 |
---|---|
Coefficient of variation (CV) | 1.4766555 |
Kurtosis | 3.171038 |
Mean | 239.6 |
Median Absolute Deviation (MAD) | 25 |
Skewness | 1.9517025 |
Sum | 2396 |
Variance | 125179.16 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
59 | 1 | 0.8% |
667 | 1 | 0.8% |
186 | 1 | 0.8% |
33 | 1 | 0.8% |
34 | 1 | 0.8% |
1079 | 1 | 0.8% |
46 | 1 | 0.8% |
214 | 1 | 0.8% |
27 | 1 | 0.8% |
51 | 1 | 0.8% |
(Missing) | 114 |
Value | Count | Frequency (%) |
27 | 1 | |
33 | 1 | |
34 | 1 | |
46 | 1 | |
51 | 1 | |
59 | 1 | |
186 | 1 | |
214 | 1 | |
667 | 1 | |
1079 | 1 |
Value | Count | Frequency (%) |
1079 | 1 | |
667 | 1 | |
214 | 1 | |
186 | 1 | |
59 | 1 | |
51 | 1 | |
46 | 1 | |
34 | 1 | |
33 | 1 | |
27 | 1 |
play_hour
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 10 |
---|---|
Distinct (%) | 100.0% |
Missing | 114 |
Missing (%) | 91.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 0.06656 |
Minimum | 0.0075 |
---|---|
Maximum | 0.2997 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.2 KiB |
Quantile statistics
Minimum | 0.0075 |
---|---|
5-th percentile | 0.008265 |
Q1 | 0.01025 |
median | 0.0153 |
Q3 | 0.057475 |
95-th percentile | 0.24822 |
Maximum | 0.2997 |
Range | 0.2922 |
Interquartile range (IQR) | 0.047225 |
Descriptive statistics
Standard deviation | 0.098273306 |
---|---|
Coefficient of variation (CV) | 1.4764619 |
Kurtosis | 3.1700556 |
Mean | 0.06656 |
Median Absolute Deviation (MAD) | 0.00695 |
Skewness | 1.9515842 |
Sum | 0.6656 |
Variance | 0.0096576427 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
0.0164 | 1 | 0.8% |
0.1853 | 1 | 0.8% |
0.0517 | 1 | 0.8% |
0.0092 | 1 | 0.8% |
0.0094 | 1 | 0.8% |
0.2997 | 1 | 0.8% |
0.0128 | 1 | 0.8% |
0.0594 | 1 | 0.8% |
0.0075 | 1 | 0.8% |
0.0142 | 1 | 0.8% |
(Missing) | 114 |
Value | Count | Frequency (%) |
0.0075 | 1 | |
0.0092 | 1 | |
0.0094 | 1 | |
0.0128 | 1 | |
0.0142 | 1 | |
0.0164 | 1 | |
0.0517 | 1 | |
0.0594 | 1 | |
0.1853 | 1 | |
0.2997 | 1 |
Value | Count | Frequency (%) |
0.2997 | 1 | |
0.1853 | 1 | |
0.0594 | 1 | |
0.0517 | 1 | |
0.0164 | 1 | |
0.0142 | 1 | |
0.0128 | 1 | |
0.0094 | 1 | |
0.0092 | 1 | |
0.0075 | 1 |
file_size
Real number (ℝ)
HIGH CORRELATION
  MISSING
 
Distinct | 10 |
---|---|
Distinct (%) | 100.0% |
Missing | 114 |
Missing (%) | 91.9% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 38938598 |
Minimum | 4447308 |
---|---|
Maximum | 1.6952648 × 108 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 1.2 KiB |
Quantile statistics
Minimum | 4447308 |
---|---|
5-th percentile | 4888862.4 |
Q1 | 6404105.8 |
median | 8935672.5 |
Q3 | 34356230 |
95-th percentile | 1.434459 × 108 |
Maximum | 1.6952648 × 108 |
Range | 1.6507917 × 108 |
Interquartile range (IQR) | 27952125 |
Descriptive statistics
Standard deviation | 56339606 |
---|---|
Coefficient of variation (CV) | 1.4468833 |
Kurtosis | 2.7024016 |
Mean | 38938598 |
Median Absolute Deviation (MAD) | 3997748.5 |
Skewness | 1.8724102 |
Sum | 3.8938598 × 108 |
Variance | 3.1741512 × 1015 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
9494006 | 1 | 0.8% |
111569625 | 1 | 0.8% |
31426768 | 1 | 0.8% |
5428540 | 1 | 0.8% |
5916616 | 1 | 0.8% |
169526481 | 1 | 0.8% |
7866575 | 1 | 0.8% |
35332718 | 1 | 0.8% |
4447308 | 1 | 0.8% |
8377339 | 1 | 0.8% |
(Missing) | 114 |
Value | Count | Frequency (%) |
4447308 | 1 | |
5428540 | 1 | |
5916616 | 1 | |
7866575 | 1 | |
8377339 | 1 | |
9494006 | 1 | |
31426768 | 1 | |
35332718 | 1 | |
111569625 | 1 | |
169526481 | 1 |
Value | Count | Frequency (%) |
169526481 | 1 | |
111569625 | 1 | |
35332718 | 1 | |
31426768 | 1 | |
9494006 | 1 | |
8377339 | 1 | |
7866575 | 1 | |
5916616 | 1 | |
5428540 | 1 | |
4447308 | 1 |
vod_path
Text
MISSING
 
Distinct | 10 |
---|---|
Distinct (%) | 100.0% |
Missing | 114 |
Missing (%) | 91.9% |
Memory size | 1.1 KiB |
Length
Max length | 61 |
---|---|
Median length | 61 |
Mean length | 61 |
Min length | 61 |
Characters and Unicode
Total characters | 610 |
---|---|
Distinct characters | 20 |
Distinct categories | 4 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 10 ? |
---|---|
Unique (%) | 100.0% |
Sample
1st row | /mbnvod2/673/2014/12/01/20141201103143_20_673_1085286_360.mp4 |
---|---|
2nd row | /mbnvod2/673/2014/12/02/20141202104424_20_673_1085361_360.mp4 |
3rd row | /mbnvod2/673/2014/12/02/20141202104424_20_673_1085362_360.mp4 |
4th row | /mbnvod2/673/2014/12/03/20141203101507_20_673_1085431_360.mp4 |
5th row | /mbnvod2/673/2014/12/04/20141204101630_20_673_1085494_360.mp4 |
Value | Count | Frequency (%) |
mbnvod2/673/2014/12/01/20141201103143_20_673_1085286_360.mp4 | 1 | |
mbnvod2/673/2014/12/02/20141202104424_20_673_1085361_360.mp4 | 1 | |
mbnvod2/673/2014/12/02/20141202104424_20_673_1085362_360.mp4 | 1 | |
mbnvod2/673/2014/12/03/20141203101507_20_673_1085431_360.mp4 | 1 | |
mbnvod2/673/2014/12/04/20141204101630_20_673_1085494_360.mp4 | 1 | |
mbnvod2/673/2014/12/04/20141204102655_20_673_1085495_360.mp4 | 1 | |
mbnvod2/673/2014/12/04/20141204102655_20_673_1085496_360.mp4 | 1 | |
mbnvod2/673/2014/12/04/20141204102431_20_673_1085497_360.mp4 | 1 | |
mbnvod2/673/2014/12/05/20141205103252_20_673_1085568_360.mp4 | 1 | |
mbnvod2/673/2014/12/05/20141205102302_20_673_1085569_360.mp4 | 1 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 83 | |
2 | 75 | |
1 | 68 | |
/ | 60 | |
4 | 52 | |
3 | 41 | 6.7% |
_ | 40 | 6.6% |
6 | 39 | 6.4% |
5 | 23 | 3.8% |
7 | 22 | 3.6% |
Other values (10) | 107 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 420 | |
Lowercase Letter | 80 | 13.1% |
Other Punctuation | 70 | 11.5% |
Connector Punctuation | 40 | 6.6% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
0 | 83 | |
2 | 75 | |
1 | 68 | |
4 | 52 | |
3 | 41 | |
6 | 39 | |
5 | 23 | 5.5% |
7 | 22 | 5.2% |
8 | 12 | 2.9% |
9 | 5 | 1.2% |
Lowercase Letter
Value | Count | Frequency (%) |
m | 20 | |
d | 10 | |
o | 10 | |
v | 10 | |
n | 10 | |
b | 10 | |
p | 10 |
Other Punctuation
Value | Count | Frequency (%) |
/ | 60 | |
. | 10 | 14.3% |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 40 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 530 | |
Latin | 80 | 13.1% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
0 | 83 | |
2 | 75 | |
1 | 68 | |
/ | 60 | |
4 | 52 | |
3 | 41 | |
_ | 40 | |
6 | 39 | |
5 | 23 | 4.3% |
7 | 22 | 4.2% |
Other values (3) | 27 | 5.1% |
Latin
Value | Count | Frequency (%) |
m | 20 | |
d | 10 | |
o | 10 | |
v | 10 | |
n | 10 | |
b | 10 | |
p | 10 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 610 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 83 | |
2 | 75 | |
1 | 68 | |
/ | 60 | |
4 | 52 | |
3 | 41 | 6.7% |
_ | 40 | 6.6% |
6 | 39 | 6.4% |
5 | 23 | 3.8% |
7 | 22 | 3.6% |
Other values (10) | 107 |
title
Text
MISSING
 
Distinct | 8 |
---|---|
Distinct (%) | 80.0% |
Missing | 114 |
Missing (%) | 91.9% |
Memory size | 1.1 KiB |
Length
Max length | 27 |
---|---|
Median length | 26 |
Mean length | 17.8 |
Min length | 11 |
Characters and Unicode
Total characters | 178 |
---|---|
Distinct characters | 75 |
Distinct categories | 9 ? |
Distinct scripts | 3 ? |
Distinct blocks | 4 ? |
Unique
Unique | 7 ? |
---|---|
Unique (%) | 70.0% |
Sample
1st row | (이슈 파해치기)오프닝 |
---|---|
2nd row | 인물 파헤치기-'표현의 자유 vs 종북 발언' 1 |
3rd row | 인물 파헤치기-'표현의 자유 vs 종북 발언' 2 |
4th row | <뉴스 파이터> 오프닝 |
5th row | <뉴스 파이터> 오프닝 |
Value | Count | Frequency (%) |
뉴스 | 3 | 6.8% |
오프닝 | 3 | 6.8% |
파이터 | 3 | 6.8% |
인물 | 3 | 6.8% |
자유 | 2 | 4.5% |
발언 | 2 | 4.5% |
vs | 2 | 4.5% |
종북 | 2 | 4.5% |
파헤치기-'표현의 | 2 | 4.5% |
유출 | 1 | 2.3% |
Other values (21) | 21 |
Most occurring characters
Value | Count | Frequency (%) |
34 | 19.1% | |
파 | 7 | 3.9% |
' | 6 | 3.4% |
오 | 5 | 2.8% |
프 | 4 | 2.2% |
기 | 4 | 2.2% |
닝 | 4 | 2.2% |
치 | 4 | 2.2% |
이 | 4 | 2.2% |
인 | 3 | 1.7% |
Other values (65) | 103 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 117 | |
Space Separator | 34 | 19.1% |
Other Punctuation | 10 | 5.6% |
Math Symbol | 6 | 3.4% |
Lowercase Letter | 4 | 2.2% |
Dash Punctuation | 3 | 1.7% |
Decimal Number | 2 | 1.1% |
Close Punctuation | 1 | 0.6% |
Open Punctuation | 1 | 0.6% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
파 | 7 | 6.0% |
오 | 5 | 4.3% |
프 | 4 | 3.4% |
기 | 4 | 3.4% |
닝 | 4 | 3.4% |
치 | 4 | 3.4% |
이 | 4 | 3.4% |
인 | 3 | 2.6% |
뉴 | 3 | 2.6% |
의 | 3 | 2.6% |
Other values (52) | 76 |
Other Punctuation
Value | Count | Frequency (%) |
' | 6 | |
… | 2 | 20.0% |
· | 2 | 20.0% |
Math Symbol
Value | Count | Frequency (%) |
< | 3 | |
> | 3 |
Lowercase Letter
Value | Count | Frequency (%) |
s | 2 | |
v | 2 |
Decimal Number
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 |
Space Separator
Value | Count | Frequency (%) |
34 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 3 |
Close Punctuation
Value | Count | Frequency (%) |
) | 1 |
Open Punctuation
Value | Count | Frequency (%) |
( | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 117 | |
Common | 57 | |
Latin | 4 | 2.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
파 | 7 | 6.0% |
오 | 5 | 4.3% |
프 | 4 | 3.4% |
기 | 4 | 3.4% |
닝 | 4 | 3.4% |
치 | 4 | 3.4% |
이 | 4 | 3.4% |
인 | 3 | 2.6% |
뉴 | 3 | 2.6% |
의 | 3 | 2.6% |
Other values (52) | 76 |
Common
Value | Count | Frequency (%) |
34 | ||
' | 6 | 10.5% |
- | 3 | 5.3% |
< | 3 | 5.3% |
> | 3 | 5.3% |
… | 2 | 3.5% |
· | 2 | 3.5% |
) | 1 | 1.8% |
1 | 1 | 1.8% |
2 | 1 | 1.8% |
Latin
Value | Count | Frequency (%) |
s | 2 | |
v | 2 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 117 | |
ASCII | 57 | |
Punctuation | 2 | 1.1% |
None | 2 | 1.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
34 | ||
' | 6 | 10.5% |
- | 3 | 5.3% |
< | 3 | 5.3% |
> | 3 | 5.3% |
s | 2 | 3.5% |
v | 2 | 3.5% |
) | 1 | 1.8% |
1 | 1 | 1.8% |
2 | 1 | 1.8% |
Hangul
Value | Count | Frequency (%) |
파 | 7 | 6.0% |
오 | 5 | 4.3% |
프 | 4 | 3.4% |
기 | 4 | 3.4% |
닝 | 4 | 3.4% |
치 | 4 | 3.4% |
이 | 4 | 3.4% |
인 | 3 | 2.6% |
뉴 | 3 | 2.6% |
의 | 3 | 2.6% |
Other values (52) | 76 |
Punctuation
Value | Count | Frequency (%) |
… | 2 |
None
Value | Count | Frequency (%) |
· | 2 |
contents
Text
MISSING
 
Distinct | 9 |
---|---|
Distinct (%) | 90.0% |
Missing | 114 |
Missing (%) | 91.9% |
Memory size | 1.1 KiB |
Length
Max length | 67 |
---|---|
Median length | 27 |
Mean length | 33 |
Min length | 6 |
Characters and Unicode
Total characters | 330 |
---|---|
Distinct characters | 122 |
Distinct categories | 7 ? |
Distinct scripts | 2 ? |
Distinct blocks | 3 ? |
Unique
Unique | 8 ? |
---|---|
Unique (%) | 80.0% |
Sample
1st row | 안녕하세요? |
---|---|
2nd row | 인물을 집중 해부해보는 코너, 인물 파헤치기입니다. |
3rd row | 인물을 집중 해부해보는 코너, 인물 파헤치기입니다. |
4th row | 시청자 여러분 안녕하십니까? 대통령의 측근, 비선, 동생과 관련된 얘기가 오늘 내린 하얀 눈처럼 세상을 덮고 있습니다. |
5th row | 시청자 여러분 안녕하십니까. 청와대에서 작성된 문건을 과연 누가 왜 유출했는지를 두고 온통 추측이 난무합니다. |
Value | Count | Frequency (%) |
시청자 | 3 | 4.1% |
안녕하십니까 | 3 | 4.1% |
여러분 | 3 | 4.1% |
인물을 | 2 | 2.7% |
집중 | 2 | 2.7% |
있습니다 | 2 | 2.7% |
파헤치기입니다 | 2 | 2.7% |
인물 | 2 | 2.7% |
코너 | 2 | 2.7% |
해부해보는 | 2 | 2.7% |
Other values (51) | 51 |
Most occurring characters
Value | Count | Frequency (%) |
70 | 21.2% | |
니 | 9 | 2.7% |
이 | 8 | 2.4% |
. | 8 | 2.4% |
다 | 7 | 2.1% |
시 | 5 | 1.5% |
하 | 5 | 1.5% |
해 | 5 | 1.5% |
, | 5 | 1.5% |
인 | 4 | 1.2% |
Other values (112) | 204 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 235 | |
Space Separator | 70 | 21.2% |
Other Punctuation | 17 | 5.2% |
Math Symbol | 4 | 1.2% |
Decimal Number | 2 | 0.6% |
Open Punctuation | 1 | 0.3% |
Close Punctuation | 1 | 0.3% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
니 | 9 | 3.8% |
이 | 8 | 3.4% |
다 | 7 | 3.0% |
시 | 5 | 2.1% |
하 | 5 | 2.1% |
해 | 5 | 2.1% |
인 | 4 | 1.7% |
기 | 4 | 1.7% |
녕 | 4 | 1.7% |
안 | 4 | 1.7% |
Other values (102) | 180 |
Other Punctuation
Value | Count | Frequency (%) |
. | 8 | |
, | 5 | |
? | 2 | 11.8% |
' | 2 | 11.8% |
Math Symbol
Value | Count | Frequency (%) |
< | 2 | |
> | 2 |
Space Separator
Value | Count | Frequency (%) |
70 |
Decimal Number
Value | Count | Frequency (%) |
1 | 2 |
Open Punctuation
Value | Count | Frequency (%) |
【 | 1 |
Close Punctuation
Value | Count | Frequency (%) |
】 | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 235 | |
Common | 95 |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
니 | 9 | 3.8% |
이 | 8 | 3.4% |
다 | 7 | 3.0% |
시 | 5 | 2.1% |
하 | 5 | 2.1% |
해 | 5 | 2.1% |
인 | 4 | 1.7% |
기 | 4 | 1.7% |
녕 | 4 | 1.7% |
안 | 4 | 1.7% |
Other values (102) | 180 |
Common
Value | Count | Frequency (%) |
70 | ||
. | 8 | 8.4% |
, | 5 | 5.3% |
? | 2 | 2.1% |
< | 2 | 2.1% |
1 | 2 | 2.1% |
> | 2 | 2.1% |
' | 2 | 2.1% |
【 | 1 | 1.1% |
】 | 1 | 1.1% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 235 | |
ASCII | 93 | 28.2% |
None | 2 | 0.6% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
70 | ||
. | 8 | 8.6% |
, | 5 | 5.4% |
? | 2 | 2.2% |
< | 2 | 2.2% |
1 | 2 | 2.2% |
> | 2 | 2.2% |
' | 2 | 2.2% |
Hangul
Value | Count | Frequency (%) |
니 | 9 | 3.8% |
이 | 8 | 3.4% |
다 | 7 | 3.0% |
시 | 5 | 2.1% |
하 | 5 | 2.1% |
해 | 5 | 2.1% |
인 | 4 | 1.7% |
기 | 4 | 1.7% |
녕 | 4 | 1.7% |
안 | 4 | 1.7% |
Other values (102) | 180 |
None
Value | Count | Frequency (%) |
【 | 1 | |
】 | 1 |
Unnamed: 8
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 124 |
---|---|
Missing (%) | 100.0% |
Memory size | 1.2 KiB |
vod_seq_no | bcast_seq_no | play_sec | play_hour | file_size | vod_path | title | contents | |
---|---|---|---|---|---|---|---|---|
vod_seq_no | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
bcast_seq_no | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 1.000 |
play_sec | 1.000 | 0.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 |
play_hour | 1.000 | 0.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 |
file_size | 1.000 | 0.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.000 |
vod_path | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 |
title | 1.000 | 0.000 | 1.000 | 1.000 | 1.000 | 1.000 | 1.000 | 0.782 |
contents | 1.000 | 1.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.782 | 1.000 |
bcast_seq_no | play_sec | play_hour | file_size | |
---|---|---|---|---|
bcast_seq_no | 1.000 | -0.285 | -0.285 | -0.285 |
play_sec | -0.285 | 1.000 | 1.000 | 1.000 |
play_hour | -0.285 | 1.000 | 1.000 | 1.000 |
file_size | -0.285 | 1.000 | 1.000 | 1.000 |
vod_seq_no | bcast_seq_no | play_sec | play_hour | file_size | vod_path | title | contents | Unnamed: 8 | |
---|---|---|---|---|---|---|---|---|---|
0 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
1 | 557892 | 1085286 | 59 | 0.0164 | 9494006 | /mbnvod2/673/2014/12/01/20141201103143_20_673_1085286_360.mp4 | (이슈 파해치기)오프닝 | 안녕하세요? | <NA> |
2 | 뉴스를 파헤치고 이슈를 터트리는 뉴스, | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
3 | 뉴스 파이터.. | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
4 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
5 | 진행을 맡은 최중락입니다. | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
6 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
7 | 저와 함께 뉴스를 철저하게 해부해주실 분, 소개합니다. | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
8 | 먼저, 정치 사회 전반에서 벌어지는 시사를 격파해 주실, | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
9 | 뉴스멘토 황장수 미래경영연구소 소장 나오셨고요. | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
vod_seq_no | bcast_seq_no | play_sec | play_hour | file_size | vod_path | title | contents | Unnamed: 8 | |
---|---|---|---|---|---|---|---|---|---|
114 | <2>추위 뿐 만 아니라 눈 소식도 잦은데요. | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
115 | 충청과 호남지역에는 연일 대설특보가 내려진 가운데 오늘 밤까지 계속해서 눈이 내렸다 그쳤다를 반복하겠습니다. | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
116 | 특히 서해안과 제주산간을 중심으로는 최고 15cm이상의 많은 눈이 예보돼 있는데요. | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
117 | 기온이 낮아 눈이 어는 곳이 있어 교통안전에 각별히 주의하셔야겠습니다. | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
118 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
119 | <주간>당분간 영하권 추위는 계속되겠고요. | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
120 | 다음주 월요일에는 충청 이남에 또 한 차례 눈이 내릴 전망입니다. 날씨였습니다. | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
121 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
122 | (전주원 기상캐스터) | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
123 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> |
Most frequently occurring
vod_seq_no | bcast_seq_no | play_sec | play_hour | file_size | vod_path | title | contents | # duplicates | |
---|---|---|---|---|---|---|---|---|---|
3 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 37 |
1 | 【 기자 】 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 4 |
2 | 【 앵커멘트 】 | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 3 |
0 | (전주원 기상캐스터) | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | <NA> | 2 |