Overview

Dataset statistics

Number of variables5
Number of observations486
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory19.6 KiB
Average record size in memory41.3 B

Variable types

Numeric1
DateTime1
Categorical1
Text2

Dataset

Description도박 및 사행산업 관련 신문기사 스크랩 자료입니다. 2021년 1월 1일부터 2021년 12월 31일까지 관련 기사의 발행년, 발행월, 발행일과 분류, 발행기관, 기사링크입니다.
Author한국도박문제관리센터
URLhttps://www.data.go.kr/data/15108261/fileData.do

Alerts

순번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 09:41:42.324662
Analysis finished2023-12-12 09:41:42.845466
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

UNIQUE 

Distinct486
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean243.5
Minimum1
Maximum486
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.4 KiB
2023-12-12T18:41:42.939322image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile25.25
Q1122.25
median243.5
Q3364.75
95-th percentile461.75
Maximum486
Range485
Interquartile range (IQR)242.5

Descriptive statistics

Standard deviation140.44038
Coefficient of variation (CV)0.5767572
Kurtosis-1.2
Mean243.5
Median Absolute Deviation (MAD)121.5
Skewness0
Sum118341
Variance19723.5
MonotonicityStrictly increasing
2023-12-12T18:41:43.105886image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.2%
306 1
 
0.2%
334 1
 
0.2%
333 1
 
0.2%
332 1
 
0.2%
331 1
 
0.2%
330 1
 
0.2%
329 1
 
0.2%
328 1
 
0.2%
327 1
 
0.2%
Other values (476) 476
97.9%
ValueCountFrequency (%)
1 1
0.2%
2 1
0.2%
3 1
0.2%
4 1
0.2%
5 1
0.2%
6 1
0.2%
7 1
0.2%
8 1
0.2%
9 1
0.2%
10 1
0.2%
ValueCountFrequency (%)
486 1
0.2%
485 1
0.2%
484 1
0.2%
483 1
0.2%
482 1
0.2%
481 1
0.2%
480 1
0.2%
479 1
0.2%
478 1
0.2%
477 1
0.2%

일자
Date

Distinct211
Distinct (%)43.4%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
Minimum2021-01-25 00:00:00
Maximum2021-12-31 00:00:00
2023-12-12T18:41:43.269622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T18:41:43.424085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

분류
Categorical

Distinct7
Distinct (%)1.4%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
도박관련
227 
도박중독
212 
청소년 도박
33 
도박관련(주식)
 
9
도박관련(가상자산)
 
3
Other values (2)
 
2

Length

Max length11
Median length10
Mean length4.7736626
Min length4

Unique

Unique2 ?
Unique (%)0.4%

Sample

1st row도박중독
2nd row도박중독
3rd row도박관련(주식)
4th row청소년 도박
5th row도박중독

Common Values

ValueCountFrequency (%)
도박관련 227
46.7%
도박중독 212
43.6%
청소년 도박 33
 
6.8%
도박관련(주식) 9
 
1.9%
도박관련(가상자산) 3
 
0.6%
도박관련(주식·코인) 1
 
0.2%
도박관련(코인) 1
 
0.2%

Length

2023-12-12T18:41:43.560768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T18:41:43.687733image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
도박관련 227
43.7%
도박중독 212
40.8%
청소년 33
 
6.4%
도박 33
 
6.4%
도박관련(주식 9
 
1.7%
도박관련(가상자산 3
 
0.6%
도박관련(주식·코인 1
 
0.2%
도박관련(코인 1
 
0.2%

매체
Text

Distinct121
Distinct (%)24.9%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
2023-12-12T18:41:43.966716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length3.8497942
Min length3

Characters and Unicode

Total characters1871
Distinct characters157
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique66 ?
Unique (%)13.6%

Sample

1st row경남매일
2nd row대구신문
3rd row연합뉴스
4th row이투데이
5th row뉴스1
ValueCountFrequency (%)
뉴스1 72
 
14.7%
연합뉴스 60
 
12.3%
뉴시스 45
 
9.2%
kbs 28
 
5.7%
조선일보 15
 
3.1%
mbc 12
 
2.5%
프레시안 10
 
2.0%
한국경제 9
 
1.8%
머니투데이 9
 
1.8%
중앙일보 9
 
1.8%
Other values (112) 220
45.0%
2023-12-12T18:41:44.382598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
241
 
12.9%
224
 
12.0%
80
 
4.3%
1 72
 
3.8%
66
 
3.5%
66
 
3.5%
63
 
3.4%
B 60
 
3.2%
59
 
3.2%
S 45
 
2.4%
Other values (147) 895
47.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1558
83.3%
Uppercase Letter 233
 
12.5%
Decimal Number 72
 
3.8%
Lowercase Letter 5
 
0.3%
Space Separator 3
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
241
 
15.5%
224
 
14.4%
80
 
5.1%
66
 
4.2%
66
 
4.2%
63
 
4.0%
59
 
3.8%
45
 
2.9%
40
 
2.6%
34
 
2.2%
Other values (127) 640
41.1%
Uppercase Letter
ValueCountFrequency (%)
B 60
25.8%
S 45
19.3%
K 28
12.0%
T 21
 
9.0%
C 20
 
8.6%
M 19
 
8.2%
N 12
 
5.2%
V 9
 
3.9%
Y 7
 
3.0%
A 5
 
2.1%
Other values (3) 7
 
3.0%
Lowercase Letter
ValueCountFrequency (%)
c 1
20.0%
u 1
20.0%
b 1
20.0%
i 1
20.0%
e 1
20.0%
Decimal Number
ValueCountFrequency (%)
1 72
100.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1558
83.3%
Latin 238
 
12.7%
Common 75
 
4.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
241
 
15.5%
224
 
14.4%
80
 
5.1%
66
 
4.2%
66
 
4.2%
63
 
4.0%
59
 
3.8%
45
 
2.9%
40
 
2.6%
34
 
2.2%
Other values (127) 640
41.1%
Latin
ValueCountFrequency (%)
B 60
25.2%
S 45
18.9%
K 28
11.8%
T 21
 
8.8%
C 20
 
8.4%
M 19
 
8.0%
N 12
 
5.0%
V 9
 
3.8%
Y 7
 
2.9%
A 5
 
2.1%
Other values (8) 12
 
5.0%
Common
ValueCountFrequency (%)
1 72
96.0%
3
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1558
83.3%
ASCII 313
 
16.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
241
 
15.5%
224
 
14.4%
80
 
5.1%
66
 
4.2%
66
 
4.2%
63
 
4.0%
59
 
3.8%
45
 
2.9%
40
 
2.6%
34
 
2.2%
Other values (127) 640
41.1%
ASCII
ValueCountFrequency (%)
1 72
23.0%
B 60
19.2%
S 45
14.4%
K 28
 
8.9%
T 21
 
6.7%
C 20
 
6.4%
M 19
 
6.1%
N 12
 
3.8%
V 9
 
2.9%
Y 7
 
2.2%
Other values (10) 20
 
6.4%
Distinct483
Distinct (%)99.4%
Missing0
Missing (%)0.0%
Memory size3.9 KiB
2023-12-12T18:41:44.646766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length151
Median length140
Mean length60.156379
Min length26

Characters and Unicode

Total characters29236
Distinct characters78
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique480 ?
Unique (%)98.8%

Sample

1st rowhttp://www.gnmaeil.com/news/articleView.html?idxno=465015
2nd rowhttps://www.idaegu.co.kr/news/articleView.html?idxno=335553
3rd rowhttps://n.news.naver.com/mnews/article/001/0012158419?sid=102
4th rowhttps://www.etoday.co.kr/news/view/1987452
5th rowhttps://n.news.naver.com/mnews/article/421/0005127370?sid=102
ValueCountFrequency (%)
https://www.yna.co.kr/view/akr20210725047200053?input=1195m 2
 
0.4%
https://www.chosun.com/national/incident/2021/07/25/jeidwxw5p5a4zi6atqoiyfwq6e/?utm_source=naver&utm_medium=referral&utm_campaign=naver-news 2
 
0.4%
https://n.news.naver.com/mnews/article/052/0001543766?sid=102 2
 
0.4%
http://www.idomin.com/news/articleview.html?idxno=781258 1
 
0.2%
https://www.hani.co.kr/arti/area/honam/1010830.html 1
 
0.2%
https://www.hankyung.com/society/article/2021090659111 1
 
0.2%
http://monthly.chosun.com/client/news/viw.asp?ctcd=e&nnewsnumb=202109100039 1
 
0.2%
https://www.news1.kr/articles/?4420949 1
 
0.2%
https://www.etoday.co.kr/news/view/2058091 1
 
0.2%
https://www.nocutnews.co.kr/news/5616825 1
 
0.2%
Other values (473) 473
97.3%
2023-12-12T18:41:45.033416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
/ 2222
 
7.6%
w 1713
 
5.9%
t 1640
 
5.6%
0 1608
 
5.5%
e 1401
 
4.8%
. 1310
 
4.5%
1 1294
 
4.4%
s 1283
 
4.4%
n 1073
 
3.7%
2 1060
 
3.6%
Other values (68) 14632
50.0%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter 15879
54.3%
Decimal Number 6786
23.2%
Other Punctuation 4597
 
15.7%
Uppercase Letter 1232
 
4.2%
Math Symbol 499
 
1.7%
Connector Punctuation 211
 
0.7%
Dash Punctuation 22
 
0.1%
Space Separator 4
 
< 0.1%
Other Letter 3
 
< 0.1%
Close Punctuation 2
 
< 0.1%

Most frequent character per category

Lowercase Letter
ValueCountFrequency (%)
w 1713
 
10.8%
t 1640
 
10.3%
e 1401
 
8.8%
s 1283
 
8.1%
n 1073
 
6.8%
i 1025
 
6.5%
c 916
 
5.8%
o 797
 
5.0%
r 782
 
4.9%
p 773
 
4.9%
Other values (16) 4476
28.2%
Uppercase Letter
ValueCountFrequency (%)
I 169
13.7%
N 116
 
9.4%
A 116
 
9.4%
D 109
 
8.8%
R 92
 
7.5%
V 85
 
6.9%
K 76
 
6.2%
S 64
 
5.2%
X 53
 
4.3%
P 33
 
2.7%
Other values (16) 319
25.9%
Decimal Number
ValueCountFrequency (%)
0 1608
23.7%
1 1294
19.1%
2 1060
15.6%
5 504
 
7.4%
4 474
 
7.0%
3 452
 
6.7%
8 365
 
5.4%
6 363
 
5.3%
9 356
 
5.2%
7 310
 
4.6%
Other Punctuation
ValueCountFrequency (%)
/ 2222
48.3%
. 1310
28.5%
: 486
 
10.6%
? 369
 
8.0%
& 201
 
4.4%
# 8
 
0.2%
' 1
 
< 0.1%
Other Letter
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Math Symbol
ValueCountFrequency (%)
= 499
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 211
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 22
100.0%
Space Separator
ValueCountFrequency (%)
4
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Final Punctuation
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin 17111
58.5%
Common 12122
41.5%
Hangul 3
 
< 0.1%

Most frequent character per script

Latin
ValueCountFrequency (%)
w 1713
 
10.0%
t 1640
 
9.6%
e 1401
 
8.2%
s 1283
 
7.5%
n 1073
 
6.3%
i 1025
 
6.0%
c 916
 
5.4%
o 797
 
4.7%
r 782
 
4.6%
p 773
 
4.5%
Other values (42) 5708
33.4%
Common
ValueCountFrequency (%)
/ 2222
18.3%
0 1608
13.3%
. 1310
10.8%
1 1294
10.7%
2 1060
8.7%
5 504
 
4.2%
= 499
 
4.1%
: 486
 
4.0%
4 474
 
3.9%
3 452
 
3.7%
Other values (13) 2213
18.3%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 29232
> 99.9%
Hangul 3
 
< 0.1%
Punctuation 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
/ 2222
 
7.6%
w 1713
 
5.9%
t 1640
 
5.6%
0 1608
 
5.5%
e 1401
 
4.8%
. 1310
 
4.5%
1 1294
 
4.4%
s 1283
 
4.4%
n 1073
 
3.7%
2 1060
 
3.6%
Other values (64) 14628
50.0%
Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%
Punctuation
ValueCountFrequency (%)
1
100.0%

Interactions

2023-12-12T18:41:42.573137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T18:41:45.119468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번분류
순번1.0000.640
분류0.6401.000
2023-12-12T18:41:45.261312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
순번분류
순번1.0000.391
분류0.3911.000

Missing values

2023-12-12T18:41:42.691521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T18:41:42.799900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

순번일자분류매체기사링크
012021-01-25도박중독경남매일http://www.gnmaeil.com/news/articleView.html?idxno=465015
122021-01-25도박중독대구신문https://www.idaegu.co.kr/news/articleView.html?idxno=335553
232021-01-25도박관련(주식)연합뉴스https://n.news.naver.com/mnews/article/001/0012158419?sid=102
342021-01-25청소년 도박이투데이https://www.etoday.co.kr/news/view/1987452
452021-01-26도박중독뉴스1https://n.news.naver.com/mnews/article/421/0005127370?sid=102
562021-01-26도박중독연합뉴스https://n.news.naver.com/mnews/article/001/0012161324?sid=102
672021-01-26도박중독뉴시스https://n.news.naver.com/mnews/article/003/0010310829?sid=102
782021-01-26도박중독대구신문https://www.idaegu.co.kr/news/articleView.html?idxno=335588
892021-01-27도박중독한국경제https://n.news.naver.com/mnews/article/015/0004490002?sid=102
9102021-01-27도박중독로이슈https://ccnews.lawissue.co.kr/view.php?ud=2021012609323742456cf2d78c68_12
순번일자분류매체기사링크
4764772021-12-27도박중독뉴스1https://www.news1.kr/articles/?4534407
4774782021-12-27도박관련연합뉴스https://www.yna.co.kr/view/AKR20211224129000052?input=1195m
4784792021-12-27도박관련뉴스1https://www.news1.kr/articles/?4532167
4794802021-12-29도박관련SBShttps://news.sbs.co.kr/news/endPage.do?news_id=N1006585239&plink=ORI&cooper=NAVER
4804812021-12-29도박관련MBChttps://imnews.imbc.com/replay/2021/nwtoday/article/6328015_34943.html
4814822021-12-29도박관련중앙일보https://www.joongang.co.kr/article/25036476
4824832021-12-29도박관련미디어펜http://www.mediapen.com/news/view/689398
4834842021-12-30도박관련뉴스1https://www.news1.kr/articles/?4538373
4844852021-12-31도박관련MBNhttps://www.mbn.co.kr/news/society/4670394
4854862021-12-31도박관련뉴스1https://www.news1.kr/articles/?4539295