Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 486 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 19.6 KiB |
Average record size in memory | 41.3 B |
Variable types
Numeric | 1 |
---|---|
DateTime | 1 |
Categorical | 1 |
Text | 2 |
Dataset
Description | 도박 및 사행산업 관련 신문기사 스크랩 자료입니다. 2021년 1월 1일부터 2021년 12월 31일까지 관련 기사의 발행년, 발행월, 발행일과 분류, 발행기관, 기사링크입니다. |
---|---|
Author | 한국도박문제관리센터 |
URL | https://www.data.go.kr/data/15108261/fileData.do |
순번 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 09:41:42.324662 |
---|---|
Analysis finished | 2023-12-12 09:41:42.845466 |
Duration | 0.52 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
순번
Real number (ℝ)
UNIQUE
 
Distinct | 486 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 243.5 |
Minimum | 1 |
---|---|
Maximum | 486 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.4 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 25.25 |
Q1 | 122.25 |
median | 243.5 |
Q3 | 364.75 |
95-th percentile | 461.75 |
Maximum | 486 |
Range | 485 |
Interquartile range (IQR) | 242.5 |
Descriptive statistics
Standard deviation | 140.44038 |
---|---|
Coefficient of variation (CV) | 0.5767572 |
Kurtosis | -1.2 |
Mean | 243.5 |
Median Absolute Deviation (MAD) | 121.5 |
Skewness | 0 |
Sum | 118341 |
Variance | 19723.5 |
Monotonicity | Strictly increasing |
Value | Count | Frequency (%) |
1 | 1 | 0.2% |
306 | 1 | 0.2% |
334 | 1 | 0.2% |
333 | 1 | 0.2% |
332 | 1 | 0.2% |
331 | 1 | 0.2% |
330 | 1 | 0.2% |
329 | 1 | 0.2% |
328 | 1 | 0.2% |
327 | 1 | 0.2% |
Other values (476) | 476 |
Value | Count | Frequency (%) |
1 | 1 | |
2 | 1 | |
3 | 1 | |
4 | 1 | |
5 | 1 | |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 |
Value | Count | Frequency (%) |
486 | 1 | |
485 | 1 | |
484 | 1 | |
483 | 1 | |
482 | 1 | |
481 | 1 | |
480 | 1 | |
479 | 1 | |
478 | 1 | |
477 | 1 |
일자
Date
Distinct | 211 |
---|---|
Distinct (%) | 43.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.9 KiB |
Minimum | 2021-01-25 00:00:00 |
---|---|
Maximum | 2021-12-31 00:00:00 |
분류
Categorical
Distinct | 7 |
---|---|
Distinct (%) | 1.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.9 KiB |
도박관련 | |
---|---|
도박중독 | |
청소년 도박 | |
도박관련(주식) | 9 |
도박관련(가상자산) | 3 |
Other values (2) | 2 |
Length
Max length | 11 |
---|---|
Median length | 10 |
Mean length | 4.7736626 |
Min length | 4 |
Unique
Unique | 2 ? |
---|---|
Unique (%) | 0.4% |
Sample
1st row | 도박중독 |
---|---|
2nd row | 도박중독 |
3rd row | 도박관련(주식) |
4th row | 청소년 도박 |
5th row | 도박중독 |
Common Values
Value | Count | Frequency (%) |
도박관련 | 227 | |
도박중독 | 212 | |
청소년 도박 | 33 | 6.8% |
도박관련(주식) | 9 | 1.9% |
도박관련(가상자산) | 3 | 0.6% |
도박관련(주식·코인) | 1 | 0.2% |
도박관련(코인) | 1 | 0.2% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
도박관련 | 227 | |
도박중독 | 212 | |
청소년 | 33 | 6.4% |
도박 | 33 | 6.4% |
도박관련(주식 | 9 | 1.7% |
도박관련(가상자산 | 3 | 0.6% |
도박관련(주식·코인 | 1 | 0.2% |
도박관련(코인 | 1 | 0.2% |
매체
Text
Distinct | 121 |
---|---|
Distinct (%) | 24.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.9 KiB |
Value | Count | Frequency (%) |
뉴스1 | 72 | 14.7% |
연합뉴스 | 60 | 12.3% |
뉴시스 | 45 | 9.2% |
kbs | 28 | 5.7% |
조선일보 | 15 | 3.1% |
mbc | 12 | 2.5% |
프레시안 | 10 | 2.0% |
한국경제 | 9 | 1.8% |
머니투데이 | 9 | 1.8% |
중앙일보 | 9 | 1.8% |
Other values (112) | 220 |
Most occurring characters
Value | Count | Frequency (%) |
스 | 241 | 12.9% |
뉴 | 224 | 12.0% |
일 | 80 | 4.3% |
1 | 72 | 3.8% |
연 | 66 | 3.5% |
합 | 66 | 3.5% |
시 | 63 | 3.4% |
B | 60 | 3.2% |
보 | 59 | 3.2% |
S | 45 | 2.4% |
Other values (147) | 895 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1558 | |
Uppercase Letter | 233 | 12.5% |
Decimal Number | 72 | 3.8% |
Lowercase Letter | 5 | 0.3% |
Space Separator | 3 | 0.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
스 | 241 | 15.5% |
뉴 | 224 | 14.4% |
일 | 80 | 5.1% |
연 | 66 | 4.2% |
합 | 66 | 4.2% |
시 | 63 | 4.0% |
보 | 59 | 3.8% |
경 | 45 | 2.9% |
이 | 40 | 2.6% |
제 | 34 | 2.2% |
Other values (127) | 640 |
Uppercase Letter
Value | Count | Frequency (%) |
B | 60 | |
S | 45 | |
K | 28 | |
T | 21 | 9.0% |
C | 20 | 8.6% |
M | 19 | 8.2% |
N | 12 | 5.2% |
V | 9 | 3.9% |
Y | 7 | 3.0% |
A | 5 | 2.1% |
Other values (3) | 7 | 3.0% |
Lowercase Letter
Value | Count | Frequency (%) |
c | 1 | |
u | 1 | |
b | 1 | |
i | 1 | |
e | 1 |
Decimal Number
Value | Count | Frequency (%) |
1 | 72 |
Space Separator
Value | Count | Frequency (%) |
3 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 1558 | |
Latin | 238 | 12.7% |
Common | 75 | 4.0% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
스 | 241 | 15.5% |
뉴 | 224 | 14.4% |
일 | 80 | 5.1% |
연 | 66 | 4.2% |
합 | 66 | 4.2% |
시 | 63 | 4.0% |
보 | 59 | 3.8% |
경 | 45 | 2.9% |
이 | 40 | 2.6% |
제 | 34 | 2.2% |
Other values (127) | 640 |
Latin
Value | Count | Frequency (%) |
B | 60 | |
S | 45 | |
K | 28 | |
T | 21 | 8.8% |
C | 20 | 8.4% |
M | 19 | 8.0% |
N | 12 | 5.0% |
V | 9 | 3.8% |
Y | 7 | 2.9% |
A | 5 | 2.1% |
Other values (8) | 12 | 5.0% |
Common
Value | Count | Frequency (%) |
1 | 72 | |
3 | 4.0% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 1558 | |
ASCII | 313 | 16.7% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
스 | 241 | 15.5% |
뉴 | 224 | 14.4% |
일 | 80 | 5.1% |
연 | 66 | 4.2% |
합 | 66 | 4.2% |
시 | 63 | 4.0% |
보 | 59 | 3.8% |
경 | 45 | 2.9% |
이 | 40 | 2.6% |
제 | 34 | 2.2% |
Other values (127) | 640 |
ASCII
Value | Count | Frequency (%) |
1 | 72 | |
B | 60 | |
S | 45 | |
K | 28 | 8.9% |
T | 21 | 6.7% |
C | 20 | 6.4% |
M | 19 | 6.1% |
N | 12 | 3.8% |
V | 9 | 2.9% |
Y | 7 | 2.2% |
Other values (10) | 20 | 6.4% |
기사링크
Text
Distinct | 483 |
---|---|
Distinct (%) | 99.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 3.9 KiB |
Length
Max length | 151 |
---|---|
Median length | 140 |
Mean length | 60.156379 |
Min length | 26 |
Characters and Unicode
Total characters | 29236 |
---|---|
Distinct characters | 78 |
Distinct categories | 11 ? |
Distinct scripts | 3 ? |
Distinct blocks | 3 ? |
Unique
Unique | 480 ? |
---|---|
Unique (%) | 98.8% |
Sample
1st row | http://www.gnmaeil.com/news/articleView.html?idxno=465015 |
---|---|
2nd row | https://www.idaegu.co.kr/news/articleView.html?idxno=335553 |
3rd row | https://n.news.naver.com/mnews/article/001/0012158419?sid=102 |
4th row | https://www.etoday.co.kr/news/view/1987452 |
5th row | https://n.news.naver.com/mnews/article/421/0005127370?sid=102 |
Value | Count | Frequency (%) |
https://www.yna.co.kr/view/akr20210725047200053?input=1195m | 2 | 0.4% |
https://www.chosun.com/national/incident/2021/07/25/jeidwxw5p5a4zi6atqoiyfwq6e/?utm_source=naver&utm_medium=referral&utm_campaign=naver-news | 2 | 0.4% |
https://n.news.naver.com/mnews/article/052/0001543766?sid=102 | 2 | 0.4% |
http://www.idomin.com/news/articleview.html?idxno=781258 | 1 | 0.2% |
https://www.hani.co.kr/arti/area/honam/1010830.html | 1 | 0.2% |
https://www.hankyung.com/society/article/2021090659111 | 1 | 0.2% |
http://monthly.chosun.com/client/news/viw.asp?ctcd=e&nnewsnumb=202109100039 | 1 | 0.2% |
https://www.news1.kr/articles/?4420949 | 1 | 0.2% |
https://www.etoday.co.kr/news/view/2058091 | 1 | 0.2% |
https://www.nocutnews.co.kr/news/5616825 | 1 | 0.2% |
Other values (473) | 473 |
Most occurring characters
Value | Count | Frequency (%) |
/ | 2222 | 7.6% |
w | 1713 | 5.9% |
t | 1640 | 5.6% |
0 | 1608 | 5.5% |
e | 1401 | 4.8% |
. | 1310 | 4.5% |
1 | 1294 | 4.4% |
s | 1283 | 4.4% |
n | 1073 | 3.7% |
2 | 1060 | 3.6% |
Other values (68) | 14632 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 15879 | |
Decimal Number | 6786 | |
Other Punctuation | 4597 | 15.7% |
Uppercase Letter | 1232 | 4.2% |
Math Symbol | 499 | 1.7% |
Connector Punctuation | 211 | 0.7% |
Dash Punctuation | 22 | 0.1% |
Space Separator | 4 | < 0.1% |
Other Letter | 3 | < 0.1% |
Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
w | 1713 | 10.8% |
t | 1640 | 10.3% |
e | 1401 | 8.8% |
s | 1283 | 8.1% |
n | 1073 | 6.8% |
i | 1025 | 6.5% |
c | 916 | 5.8% |
o | 797 | 5.0% |
r | 782 | 4.9% |
p | 773 | 4.9% |
Other values (16) | 4476 |
Uppercase Letter
Value | Count | Frequency (%) |
I | 169 | |
N | 116 | 9.4% |
A | 116 | 9.4% |
D | 109 | 8.8% |
R | 92 | 7.5% |
V | 85 | 6.9% |
K | 76 | 6.2% |
S | 64 | 5.2% |
X | 53 | 4.3% |
P | 33 | 2.7% |
Other values (16) | 319 |
Decimal Number
Value | Count | Frequency (%) |
0 | 1608 | |
1 | 1294 | |
2 | 1060 | |
5 | 504 | 7.4% |
4 | 474 | 7.0% |
3 | 452 | 6.7% |
8 | 365 | 5.4% |
6 | 363 | 5.3% |
9 | 356 | 5.2% |
7 | 310 | 4.6% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 2222 | |
. | 1310 | |
: | 486 | 10.6% |
? | 369 | 8.0% |
& | 201 | 4.4% |
# | 8 | 0.2% |
' | 1 | < 0.1% |
Other Letter
Value | Count | Frequency (%) |
말 | 1 | |
거 | 1 | |
고 | 1 |
Math Symbol
Value | Count | Frequency (%) |
= | 499 |
Connector Punctuation
Value | Count | Frequency (%) |
_ | 211 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 22 |
Space Separator
Value | Count | Frequency (%) |
4 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2 |
Final Punctuation
Value | Count | Frequency (%) |
’ | 1 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 17111 | |
Common | 12122 | |
Hangul | 3 | < 0.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
w | 1713 | 10.0% |
t | 1640 | 9.6% |
e | 1401 | 8.2% |
s | 1283 | 7.5% |
n | 1073 | 6.3% |
i | 1025 | 6.0% |
c | 916 | 5.4% |
o | 797 | 4.7% |
r | 782 | 4.6% |
p | 773 | 4.5% |
Other values (42) | 5708 |
Common
Value | Count | Frequency (%) |
/ | 2222 | |
0 | 1608 | |
. | 1310 | |
1 | 1294 | |
2 | 1060 | |
5 | 504 | 4.2% |
= | 499 | 4.1% |
: | 486 | 4.0% |
4 | 474 | 3.9% |
3 | 452 | 3.7% |
Other values (13) | 2213 |
Hangul
Value | Count | Frequency (%) |
말 | 1 | |
거 | 1 | |
고 | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 29232 | |
Hangul | 3 | < 0.1% |
Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
/ | 2222 | 7.6% |
w | 1713 | 5.9% |
t | 1640 | 5.6% |
0 | 1608 | 5.5% |
e | 1401 | 4.8% |
. | 1310 | 4.5% |
1 | 1294 | 4.4% |
s | 1283 | 4.4% |
n | 1073 | 3.7% |
2 | 1060 | 3.6% |
Other values (64) | 14628 |
Hangul
Value | Count | Frequency (%) |
말 | 1 | |
거 | 1 | |
고 | 1 |
Punctuation
Value | Count | Frequency (%) |
’ | 1 |
순번 | 분류 | |
---|---|---|
순번 | 1.000 | 0.640 |
분류 | 0.640 | 1.000 |
순번 | 분류 | |
---|---|---|
순번 | 1.000 | 0.391 |
분류 | 0.391 | 1.000 |
순번 | 일자 | 분류 | 매체 | 기사링크 | |
---|---|---|---|---|---|
0 | 1 | 2021-01-25 | 도박중독 | 경남매일 | http://www.gnmaeil.com/news/articleView.html?idxno=465015 |
1 | 2 | 2021-01-25 | 도박중독 | 대구신문 | https://www.idaegu.co.kr/news/articleView.html?idxno=335553 |
2 | 3 | 2021-01-25 | 도박관련(주식) | 연합뉴스 | https://n.news.naver.com/mnews/article/001/0012158419?sid=102 |
3 | 4 | 2021-01-25 | 청소년 도박 | 이투데이 | https://www.etoday.co.kr/news/view/1987452 |
4 | 5 | 2021-01-26 | 도박중독 | 뉴스1 | https://n.news.naver.com/mnews/article/421/0005127370?sid=102 |
5 | 6 | 2021-01-26 | 도박중독 | 연합뉴스 | https://n.news.naver.com/mnews/article/001/0012161324?sid=102 |
6 | 7 | 2021-01-26 | 도박중독 | 뉴시스 | https://n.news.naver.com/mnews/article/003/0010310829?sid=102 |
7 | 8 | 2021-01-26 | 도박중독 | 대구신문 | https://www.idaegu.co.kr/news/articleView.html?idxno=335588 |
8 | 9 | 2021-01-27 | 도박중독 | 한국경제 | https://n.news.naver.com/mnews/article/015/0004490002?sid=102 |
9 | 10 | 2021-01-27 | 도박중독 | 로이슈 | https://ccnews.lawissue.co.kr/view.php?ud=2021012609323742456cf2d78c68_12 |
순번 | 일자 | 분류 | 매체 | 기사링크 | |
---|---|---|---|---|---|
476 | 477 | 2021-12-27 | 도박중독 | 뉴스1 | https://www.news1.kr/articles/?4534407 |
477 | 478 | 2021-12-27 | 도박관련 | 연합뉴스 | https://www.yna.co.kr/view/AKR20211224129000052?input=1195m |
478 | 479 | 2021-12-27 | 도박관련 | 뉴스1 | https://www.news1.kr/articles/?4532167 |
479 | 480 | 2021-12-29 | 도박관련 | SBS | https://news.sbs.co.kr/news/endPage.do?news_id=N1006585239&plink=ORI&cooper=NAVER |
480 | 481 | 2021-12-29 | 도박관련 | MBC | https://imnews.imbc.com/replay/2021/nwtoday/article/6328015_34943.html |
481 | 482 | 2021-12-29 | 도박관련 | 중앙일보 | https://www.joongang.co.kr/article/25036476 |
482 | 483 | 2021-12-29 | 도박관련 | 미디어펜 | http://www.mediapen.com/news/view/689398 |
483 | 484 | 2021-12-30 | 도박관련 | 뉴스1 | https://www.news1.kr/articles/?4538373 |
484 | 485 | 2021-12-31 | 도박관련 | MBN | https://www.mbn.co.kr/news/society/4670394 |
485 | 486 | 2021-12-31 | 도박관련 | 뉴스1 | https://www.news1.kr/articles/?4539295 |