Overview

Dataset statistics

Number of variables8
Number of observations90
Missing cells28
Missing cells (%)3.9%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.1 KiB
Average record size in memory69.5 B

Variable types

Numeric2
Categorical5
Text1

Dataset

Description도박(청소년, 군인, 일반성인, 도박중독자 가족 등) 또는 사행산업 관련 전문도서 및 한국도박문제예방치유원 발간물 보유목록(2019년~2021년)
Author한국도박문제관리센터
URLhttps://www.data.go.kr/data/15103644/fileData.do

Alerts

발행처 is highly overall correlated with 분류 and 1 other fieldsHigh correlation
Vol is highly overall correlated with ISBN and 1 other fieldsHigh correlation
저자 is highly overall correlated with 분류 and 2 other fieldsHigh correlation
번호 is highly overall correlated with 발행년High correlation
ISBN is highly overall correlated with VolHigh correlation
분류 is highly overall correlated with 저자 and 1 other fieldsHigh correlation
발행년 is highly overall correlated with 번호High correlation
저자 is highly imbalanced (52.2%)Imbalance
발행처 is highly imbalanced (53.2%)Imbalance
ISBN has 28 (31.1%) missing valuesMissing
번호 has unique valuesUnique
자료명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:43:59.344584
Analysis finished2023-12-12 12:44:00.438735
Duration1.09 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct90
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean45.5
Minimum1
Maximum90
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size942.0 B
2023-12-12T21:44:00.533406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.45
Q123.25
median45.5
Q367.75
95-th percentile85.55
Maximum90
Range89
Interquartile range (IQR)44.5

Descriptive statistics

Standard deviation26.124701
Coefficient of variation (CV)0.57416925
Kurtosis-1.2
Mean45.5
Median Absolute Deviation (MAD)22.5
Skewness0
Sum4095
Variance682.5
MonotonicityStrictly increasing
2023-12-12T21:44:00.689409image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.1%
69 1
 
1.1%
67 1
 
1.1%
66 1
 
1.1%
65 1
 
1.1%
64 1
 
1.1%
63 1
 
1.1%
62 1
 
1.1%
61 1
 
1.1%
60 1
 
1.1%
Other values (80) 80
88.9%
ValueCountFrequency (%)
1 1
1.1%
2 1
1.1%
3 1
1.1%
4 1
1.1%
5 1
1.1%
6 1
1.1%
7 1
1.1%
8 1
1.1%
9 1
1.1%
10 1
1.1%
ValueCountFrequency (%)
90 1
1.1%
89 1
1.1%
88 1
1.1%
87 1
1.1%
86 1
1.1%
85 1
1.1%
84 1
1.1%
83 1
1.1%
82 1
1.1%
81 1
1.1%

분류
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)5.6%
Missing0
Missing (%)0.0%
Memory size852.0 B
도박중독
58 
심리치료
12 
청소년도박
10 
사행산업
중독
 
4

Length

Max length5
Median length4
Mean length4.0222222
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row사행산업
2nd row사행산업
3rd row중독
4th row사행산업
5th row청소년도박

Common Values

ValueCountFrequency (%)
도박중독 58
64.4%
심리치료 12
 
13.3%
청소년도박 10
 
11.1%
사행산업 6
 
6.7%
중독 4
 
4.4%

Length

2023-12-12T21:44:00.883203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:44:01.099502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
도박중독 58
64.4%
심리치료 12
 
13.3%
청소년도박 10
 
11.1%
사행산업 6
 
6.7%
중독 4
 
4.4%

발행년
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size852.0 B
2020
43 
2021
25 
2019
22 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020
2nd row2019
3rd row2019
4th row2019
5th row2019

Common Values

ValueCountFrequency (%)
2020 43
47.8%
2021 25
27.8%
2019 22
24.4%

Length

2023-12-12T21:44:01.253227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:44:01.371565image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020 43
47.8%
2021 25
27.8%
2019 22
24.4%

자료명
Text

UNIQUE 

Distinct90
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size852.0 B
2023-12-12T21:44:01.715697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length48
Median length34
Mean length22.477778
Min length5

Characters and Unicode

Total characters2023
Distinct characters290
Distinct categories11 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique90 ?
Unique (%)100.0%

Sample

1st row2019년 사행산업 관련 통계
2nd row사행산업정책과 미래기술II
3rd row장기추적 연구를 통해 바라본 대한민국 중독문제의 현주소
4th row2018년 사행산업 관련 통계
5th row경기도 학생 도박 실태 분석 및 예방 정책 방향 연구
ValueCountFrequency (%)
도박문제 29
 
6.4%
13
 
2.9%
위한 10
 
2.2%
안내서 10
 
2.2%
청소년 9
 
2.0%
매뉴얼 9
 
2.0%
7
 
1.5%
2019 6
 
1.3%
예방강사 6
 
1.3%
도박중독 6
 
1.3%
Other values (246) 347
76.8%
2023-12-12T21:44:02.251901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
362
 
17.9%
63
 
3.1%
56
 
2.8%
47
 
2.3%
2 44
 
2.2%
41
 
2.0%
0 35
 
1.7%
34
 
1.7%
31
 
1.5%
1 24
 
1.2%
Other values (280) 1286
63.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1387
68.6%
Space Separator 362
 
17.9%
Decimal Number 121
 
6.0%
Uppercase Letter 54
 
2.7%
Lowercase Letter 37
 
1.8%
Close Punctuation 16
 
0.8%
Open Punctuation 16
 
0.8%
Other Punctuation 15
 
0.7%
Dash Punctuation 13
 
0.6%
Math Symbol 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
63
 
4.5%
56
 
4.0%
47
 
3.4%
41
 
3.0%
34
 
2.5%
31
 
2.2%
22
 
1.6%
21
 
1.5%
20
 
1.4%
20
 
1.4%
Other values (230) 1032
74.4%
Uppercase Letter
ValueCountFrequency (%)
C 9
16.7%
I 7
13.0%
R 5
9.3%
A 4
 
7.4%
E 4
 
7.4%
T 4
 
7.4%
B 4
 
7.4%
S 3
 
5.6%
P 2
 
3.7%
M 2
 
3.7%
Other values (8) 10
18.5%
Lowercase Letter
ValueCountFrequency (%)
e 8
21.6%
s 6
16.2%
r 4
10.8%
v 2
 
5.4%
u 2
 
5.4%
o 2
 
5.4%
l 2
 
5.4%
a 2
 
5.4%
c 2
 
5.4%
h 2
 
5.4%
Other values (3) 5
13.5%
Decimal Number
ValueCountFrequency (%)
2 44
36.4%
0 35
28.9%
1 24
19.8%
9 11
 
9.1%
3 2
 
1.7%
4 2
 
1.7%
8 2
 
1.7%
5 1
 
0.8%
Other Punctuation
ValueCountFrequency (%)
. 6
40.0%
& 4
26.7%
, 2
 
13.3%
· 2
 
13.3%
! 1
 
6.7%
Space Separator
ValueCountFrequency (%)
362
100.0%
Close Punctuation
ValueCountFrequency (%)
) 16
100.0%
Open Punctuation
ValueCountFrequency (%)
( 16
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 13
100.0%
Math Symbol
ValueCountFrequency (%)
~ 1
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1387
68.6%
Common 545
 
26.9%
Latin 91
 
4.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
63
 
4.5%
56
 
4.0%
47
 
3.4%
41
 
3.0%
34
 
2.5%
31
 
2.2%
22
 
1.6%
21
 
1.5%
20
 
1.4%
20
 
1.4%
Other values (230) 1032
74.4%
Latin
ValueCountFrequency (%)
C 9
 
9.9%
e 8
 
8.8%
I 7
 
7.7%
s 6
 
6.6%
R 5
 
5.5%
A 4
 
4.4%
E 4
 
4.4%
T 4
 
4.4%
r 4
 
4.4%
B 4
 
4.4%
Other values (21) 36
39.6%
Common
ValueCountFrequency (%)
362
66.4%
2 44
 
8.1%
0 35
 
6.4%
1 24
 
4.4%
) 16
 
2.9%
( 16
 
2.9%
- 13
 
2.4%
9 11
 
2.0%
. 6
 
1.1%
& 4
 
0.7%
Other values (9) 14
 
2.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1387
68.6%
ASCII 634
31.3%
None 2
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
362
57.1%
2 44
 
6.9%
0 35
 
5.5%
1 24
 
3.8%
) 16
 
2.5%
( 16
 
2.5%
- 13
 
2.1%
9 11
 
1.7%
C 9
 
1.4%
e 8
 
1.3%
Other values (39) 96
 
15.1%
Hangul
ValueCountFrequency (%)
63
 
4.5%
56
 
4.0%
47
 
3.4%
41
 
3.0%
34
 
2.5%
31
 
2.2%
22
 
1.6%
21
 
1.5%
20
 
1.4%
20
 
1.4%
Other values (230) 1032
74.4%
None
ValueCountFrequency (%)
· 2
100.0%

Vol
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.3%
Missing0
Missing (%)0.0%
Memory size852.0 B
<NA>
74 
1
11 
2
 
5

Length

Max length4
Median length4
Mean length3.4666667
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 74
82.2%
1 11
 
12.2%
2 5
 
5.6%

Length

2023-12-12T21:44:02.406640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:44:02.533293image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 74
82.2%
1 11
 
12.2%
2 5
 
5.6%

저자
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct25
Distinct (%)27.8%
Missing0
Missing (%)0.0%
Memory size852.0 B
한국도박문제관리센터
63 
사행산업통합감독위원회
 
4
이현수 (지은이)
 
1
중독포럼, 국립정신건강센터
 
1
이근영, 이주영, 최지현, 황병록
 
1
Other values (20)
20 

Length

Max length47
Median length10
Mean length13.177778
Min length3

Unique

Unique23 ?
Unique (%)25.6%

Sample

1st row사행산업통합감독위원회
2nd row권영실 외 다수
3rd row중독포럼, 국립정신건강센터
4th row사행산업통합감독위원회
5th row이근영, 이주영, 최지현, 황병록

Common Values

ValueCountFrequency (%)
한국도박문제관리센터 63
70.0%
사행산업통합감독위원회 4
 
4.4%
이현수 (지은이) 1
 
1.1%
중독포럼, 국립정신건강센터 1
 
1.1%
이근영, 이주영, 최지현, 황병록 1
 
1.1%
Robert L. Leahy 1
 
1.1%
채연숙, 김성범 외 3명 1
 
1.1%
Alan F. Friedman (지은이), 유성진 (옮긴이) 1
 
1.1%
노상헌 1
 
1.1%
David P. Celani (지은이), 김영호, 김미란, 오남경, 김순천 (옮긴이) 1
 
1.1%
Other values (15) 15
 
16.7%

Length

2023-12-12T21:44:02.662965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한국도박문제관리센터 63
37.7%
지은이 16
 
9.6%
옮긴이 9
 
5.4%
사행산업통합감독위원회 4
 
2.4%
2
 
1.2%
강경이 1
 
0.6%
이승현 1
 
0.6%
김은하 1
 
0.6%
erford 1
 
0.6%
t 1
 
0.6%
Other values (68) 68
40.7%

발행처
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct18
Distinct (%)20.0%
Missing0
Missing (%)0.0%
Memory size852.0 B
한국도박문제관리센터
63 
학지사
사행산업통합감독위원회
 
4
블루페가수스
 
1
중독포럼, 국립정신건강센터
 
1
Other values (13)
13 

Length

Max length14
Median length10
Mean length8.8222222
Min length3

Unique

Unique15 ?
Unique (%)16.7%

Sample

1st row사행산업통합감독위원회
2nd row복권학회
3rd row중독포럼, 국립정신건강센터
4th row사행산업통합감독위원회
5th row경기도교육연구원

Common Values

ValueCountFrequency (%)
한국도박문제관리센터 63
70.0%
학지사 8
 
8.9%
사행산업통합감독위원회 4
 
4.4%
블루페가수스 1
 
1.1%
중독포럼, 국립정신건강센터 1
 
1.1%
경기도교육연구원 1
 
1.1%
교육과학사 1
 
1.1%
홍성사 1
 
1.1%
한국가족복지연구소 1
 
1.1%
복권학회 1
 
1.1%
Other values (8) 8
 
8.9%

Length

2023-12-12T21:44:02.821917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한국도박문제관리센터 63
69.2%
학지사 8
 
8.8%
사행산업통합감독위원회 4
 
4.4%
한국학술정보 1
 
1.1%
연지출판사 1
 
1.1%
가을밤 1
 
1.1%
한국형사정책연구원 1
 
1.1%
정민사 1
 
1.1%
다산초당(다산북스 1
 
1.1%
시그마프레스 1
 
1.1%
Other values (9) 9
 
9.9%

ISBN
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct58
Distinct (%)93.5%
Missing28
Missing (%)31.1%
Infinite0
Infinite (%)0.0%
Mean8.3062905 × 1012
Minimum23843691
Maximum9.7911973 × 1012
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size942.0 B
2023-12-12T21:44:02.970980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum23843691
5-th percentile4.8981907 × 1010
Q19.7889997 × 1012
median9.7911868 × 1012
Q39.7911868 × 1012
95-th percentile9.7911898 × 1012
Maximum9.7911973 × 1012
Range9.7911734 × 1012
Interquartile range (IQR)2.187096 × 109

Descriptive statistics

Standard deviation3.4180758 × 1012
Coefficient of variation (CV)0.41150448
Kurtosis1.6950468
Mean8.3062905 × 1012
Median Absolute Deviation (MAD)912296.5
Skewness-1.9014611
Sum5.1499001 × 1014
Variance1.1683242 × 1025
MonotonicityNot monotonic
2023-12-12T21:44:03.142095image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
27339947 3
 
3.3%
9791186815878 2
 
2.2%
9791186815557 2
 
2.2%
9791130629353 1
 
1.1%
9791186815793 1
 
1.1%
23843691 1
 
1.1%
9788989821168 1
 
1.1%
9791189830083 1
 
1.1%
9788999717673 1
 
1.1%
9788999724138 1
 
1.1%
Other values (48) 48
53.3%
(Missing) 28
31.1%
ValueCountFrequency (%)
23843691 1
 
1.1%
27339947 3
3.3%
979118681545 1
 
1.1%
979118681557 1
 
1.1%
979118681571 1
 
1.1%
979118681572 1
 
1.1%
979118681573 1
 
1.1%
979118681574 1
 
1.1%
9788925414850 1
 
1.1%
9788926895504 1
 
1.1%
ValueCountFrequency (%)
9791197288913 1
1.1%
9791192130002 1
1.1%
9791189908324 1
1.1%
9791189831523 1
1.1%
9791189830083 1
1.1%
9791188580057 1
1.1%
9791186815991 1
1.1%
9791186815984 1
1.1%
9791186815977 1
1.1%
9791186815960 1
1.1%

Interactions

2023-12-12T21:43:59.997828image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:43:59.827946image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:44:00.124132image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:43:59.899396image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:44:03.240991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호분류발행년자료명Vol저자발행처ISBN
번호1.0000.5660.7981.0000.0000.4230.4500.448
분류0.5661.0000.0001.0000.7190.9820.9490.000
발행년0.7980.0001.0001.0000.0000.0000.0000.124
자료명1.0001.0001.0001.0001.0001.0001.0001.000
Vol0.0000.7190.0001.0001.0001.0000.614NaN
저자0.4230.9820.0001.0001.0001.0001.0000.000
발행처0.4500.9490.0001.0000.6141.0001.0000.000
ISBN0.4480.0000.1241.000NaN0.0000.0001.000
2023-12-12T21:44:03.372005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
발행처분류Vol발행년저자
발행처1.0000.7810.3120.0000.950
분류0.7811.0000.4650.0000.726
Vol0.3120.4651.0000.0001.000
발행년0.0000.0000.0001.0000.000
저자0.9500.7261.0000.0001.000
2023-12-12T21:44:03.494890image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호ISBN분류발행년Vol저자발행처
번호1.000-0.0620.2590.6600.0000.1350.174
ISBN-0.0621.0000.0000.2091.0000.0000.000
분류0.2590.0001.0000.0000.4650.7260.781
발행년0.6600.2090.0001.0000.0000.0000.000
Vol0.0001.0000.4650.0001.0001.0000.312
저자0.1350.0000.7260.0001.0001.0000.950
발행처0.1740.0000.7810.0000.3120.9501.000

Missing values

2023-12-12T21:44:00.262495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:44:00.390469image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호분류발행년자료명Vol저자발행처ISBN
01사행산업20202019년 사행산업 관련 통계<NA>사행산업통합감독위원회사행산업통합감독위원회<NA>
12사행산업2019사행산업정책과 미래기술II<NA>권영실 외 다수복권학회9791188580057
23중독2019장기추적 연구를 통해 바라본 대한민국 중독문제의 현주소<NA>중독포럼, 국립정신건강센터중독포럼, 국립정신건강센터<NA>
34사행산업20192018년 사행산업 관련 통계<NA>사행산업통합감독위원회사행산업통합감독위원회<NA>
45청소년도박2019경기도 학생 도박 실태 분석 및 예방 정책 방향 연구<NA>이근영, 이주영, 최지현, 황병록경기도교육연구원9791189831523
56도박중독2020도박중독 사례관리 매뉴얼(2020)<NA>한국도박문제관리센터한국도박문제관리센터9791186815878
67도박중독2020KCGP Issue & Research Brief (창간호)<NA>한국도박문제관리센터한국도박문제관리센터27339947
78도박중독2020사례로 풀어보는 재정법률 이야기 (개정판)<NA>한국도박문제관리센터한국도박문제관리센터9791186815830
89도박중독20192019 도박문제 예방강사 활동 안내서<NA>한국도박문제관리센터한국도박문제관리센터<NA>
910도박중독20192019 도박문제 예방인력 양성 매뉴얼<NA>한국도박문제관리센터한국도박문제관리센터<NA>
번호분류발행년자료명Vol저자발행처ISBN
8081도박중독2021코로나19와 도박문제 요약보고서<NA>한국도박문제관리센터한국도박문제관리센터9791186815588
8182도박중독2021ISSUE & RESEARCH BRIEF(2021. 12.) vol. 3<NA>한국도박문제관리센터한국도박문제관리센터27339947
8283청소년도박2021초등대상 도박 예방교육 교육연극 수업지도안<NA>한국도박문제관리센터한국도박문제관리센터979118681557
8384도박중독2021한국도박문제관리센터의 서비스 조직 및 서비스 전달체계 개편방안 연구<NA>한국도박문제관리센터한국도박문제관리센터9791186815557
8485도박중독2021한국도박문제관리센터의 서비스 조직 및 서비스 전달체계 개편방안 연구(요약보고서)<NA>한국도박문제관리센터한국도박문제관리센터9791186815557
8586도박중독2021도박문제 예방인력 양성 매뉴얼<NA>한국도박문제관리센터한국도박문제관리센터<NA>
8687도박중독2021도박문제 예방강사 활동 안내서<NA>한국도박문제관리센터한국도박문제관리센터<NA>
8788사행산업2021사행산업의 온라인 발매제 도입에 따른 전망과 대책 연구<NA>한국도박문제관리센터한국도박문제관리센터9791186815977
8889도박중독2021한국도박문제관리센터 도박중독 재활사업 평가지표 개발<NA>한국도박문제관리센터한국도박문제관리센터<NA>
8990도박중독20212021년 강원랜드 인근지역 도박관련 체류자 실태조사<NA>한국도박문제관리센터한국도박문제관리센터9791192130002