Overview

Dataset statistics

Number of variables5
Number of observations80
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.3 KiB
Average record size in memory42.6 B

Variable types

Numeric1
Categorical2
Text2

Dataset

Description경남대표도서관에서 구독중인 연속간행물(종이잡지, 전자잡지, 종이신문, 전자신문)에 대한 대이터로 간행물 종류 간행물 명, 발행사, 간기 등을 제공합니다
URLhttps://www.data.go.kr/data/15103347/fileData.do

Alerts

연번 is highly overall correlated with 구분High correlation
구분 is highly overall correlated with 연번 and 1 other fieldsHigh correlation
간기 is highly overall correlated with 구분High correlation
연번 has unique valuesUnique

Reproduction

Analysis started2023-12-12 04:12:50.793613
Analysis finished2023-12-12 04:12:51.512568
Duration0.72 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct80
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean40.5
Minimum1
Maximum80
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size852.0 B
2023-12-12T13:12:51.627002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile4.95
Q120.75
median40.5
Q360.25
95-th percentile76.05
Maximum80
Range79
Interquartile range (IQR)39.5

Descriptive statistics

Standard deviation23.2379
Coefficient of variation (CV)0.57377531
Kurtosis-1.2
Mean40.5
Median Absolute Deviation (MAD)20
Skewness0
Sum3240
Variance540
MonotonicityStrictly increasing
2023-12-12T13:12:51.813694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.2%
42 1
 
1.2%
60 1
 
1.2%
59 1
 
1.2%
58 1
 
1.2%
57 1
 
1.2%
56 1
 
1.2%
55 1
 
1.2%
54 1
 
1.2%
53 1
 
1.2%
Other values (70) 70
87.5%
ValueCountFrequency (%)
1 1
1.2%
2 1
1.2%
3 1
1.2%
4 1
1.2%
5 1
1.2%
6 1
1.2%
7 1
1.2%
8 1
1.2%
9 1
1.2%
10 1
1.2%
ValueCountFrequency (%)
80 1
1.2%
79 1
1.2%
78 1
1.2%
77 1
1.2%
76 1
1.2%
75 1
1.2%
74 1
1.2%
73 1
1.2%
72 1
1.2%
71 1
1.2%

구분
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size772.0 B
종이잡지
40 
종이신문
20 
전자잡지
10 
전자신문
10 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row종이잡지
2nd row종이잡지
3rd row종이잡지
4th row종이잡지
5th row종이잡지

Common Values

ValueCountFrequency (%)
종이잡지 40
50.0%
종이신문 20
25.0%
전자잡지 10
 
12.5%
전자신문 10
 
12.5%

Length

2023-12-12T13:12:52.005423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:12:52.154574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
종이잡지 40
50.0%
종이신문 20
25.0%
전자잡지 10
 
12.5%
전자신문 10
 
12.5%
Distinct79
Distinct (%)98.8%
Missing0
Missing (%)0.0%
Memory size772.0 B
2023-12-12T13:12:52.485958image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length16
Median length15
Mean length5.675
Min length2

Characters and Unicode

Total characters454
Distinct characters159
Distinct categories6 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique78 ?
Unique (%)97.5%

Sample

1st rowBBC 사이언스
2nd row건강다이제스트
3rd row골프다이제스트
4th row과학동아
5th row더그아웃
ValueCountFrequency (%)
월간 8
 
7.5%
times 3
 
2.8%
the 3
 
2.8%
경남도민일보 2
 
1.9%
과학동아 2
 
1.9%
어린이 2
 
1.9%
수학동아 2
 
1.9%
한겨레신문 1
 
0.9%
중앙일보 1
 
0.9%
조선일보 1
 
0.9%
Other values (82) 82
76.6%
2023-12-12T13:12:53.048825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
28
 
6.2%
17
 
3.7%
16
 
3.5%
16
 
3.5%
14
 
3.1%
13
 
2.9%
e 11
 
2.4%
10
 
2.2%
9
 
2.0%
9
 
2.0%
Other values (149) 311
68.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 352
77.5%
Lowercase Letter 46
 
10.1%
Space Separator 28
 
6.2%
Uppercase Letter 21
 
4.6%
Decimal Number 6
 
1.3%
Other Punctuation 1
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
17
 
4.8%
16
 
4.5%
16
 
4.5%
14
 
4.0%
13
 
3.7%
10
 
2.8%
9
 
2.6%
9
 
2.6%
9
 
2.6%
8
 
2.3%
Other values (119) 231
65.6%
Lowercase Letter
ValueCountFrequency (%)
e 11
23.9%
i 7
15.2%
s 4
 
8.7%
h 4
 
8.7%
m 3
 
6.5%
g 3
 
6.5%
n 3
 
6.5%
r 2
 
4.3%
o 2
 
4.3%
a 2
 
4.3%
Other values (4) 5
10.9%
Uppercase Letter
ValueCountFrequency (%)
T 7
33.3%
K 2
 
9.5%
B 2
 
9.5%
I 2
 
9.5%
V 1
 
4.8%
G 1
 
4.8%
C 1
 
4.8%
Q 1
 
4.8%
S 1
 
4.8%
N 1
 
4.8%
Other values (2) 2
 
9.5%
Decimal Number
ValueCountFrequency (%)
1 3
50.0%
2 3
50.0%
Space Separator
ValueCountFrequency (%)
28
100.0%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 352
77.5%
Latin 67
 
14.8%
Common 35
 
7.7%

Most frequent character per script

Hangul
ValueCountFrequency (%)
17
 
4.8%
16
 
4.5%
16
 
4.5%
14
 
4.0%
13
 
3.7%
10
 
2.8%
9
 
2.6%
9
 
2.6%
9
 
2.6%
8
 
2.3%
Other values (119) 231
65.6%
Latin
ValueCountFrequency (%)
e 11
16.4%
i 7
 
10.4%
T 7
 
10.4%
s 4
 
6.0%
h 4
 
6.0%
m 3
 
4.5%
g 3
 
4.5%
n 3
 
4.5%
K 2
 
3.0%
r 2
 
3.0%
Other values (16) 21
31.3%
Common
ValueCountFrequency (%)
28
80.0%
1 3
 
8.6%
2 3
 
8.6%
& 1
 
2.9%

Most occurring blocks

ValueCountFrequency (%)
Hangul 352
77.5%
ASCII 102
 
22.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
28
27.5%
e 11
 
10.8%
i 7
 
6.9%
T 7
 
6.9%
s 4
 
3.9%
h 4
 
3.9%
1 3
 
2.9%
2 3
 
2.9%
m 3
 
2.9%
g 3
 
2.9%
Other values (20) 29
28.4%
Hangul
ValueCountFrequency (%)
17
 
4.8%
16
 
4.5%
16
 
4.5%
14
 
4.0%
13
 
3.7%
10
 
2.8%
9
 
2.6%
9
 
2.6%
9
 
2.6%
8
 
2.3%
Other values (119) 231
65.6%
Distinct67
Distinct (%)83.8%
Missing0
Missing (%)0.0%
Memory size772.0 B
2023-12-12T13:12:53.369321image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length10.5
Mean length6.1875
Min length2

Characters and Unicode

Total characters495
Distinct characters141
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique57 ?
Unique (%)71.2%

Sample

1st rowBBC사이언스 편집부
2nd row건강다이제스트사
3rd row골프다이제스트
4th row동아사이언스
5th row대단한미디어
ValueCountFrequency (%)
주식회사 5
 
5.6%
동아사이언스 4
 
4.4%
동아일보사 3
 
3.3%
오리콤 2
 
2.2%
경남도민일보사 2
 
2.2%
교원 2
 
2.2%
두산매거진 2
 
2.2%
헤럴드 2
 
2.2%
매일경제신문사 2
 
2.2%
타임즈코어 2
 
2.2%
Other values (62) 64
71.1%
2023-12-12T13:12:53.898292image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
42
 
8.5%
21
 
4.2%
20
 
4.0%
18
 
3.6%
15
 
3.0%
15
 
3.0%
15
 
3.0%
15
 
3.0%
14
 
2.8%
10
 
2.0%
Other values (131) 310
62.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 465
93.9%
Space Separator 10
 
2.0%
Open Punctuation 5
 
1.0%
Other Symbol 5
 
1.0%
Close Punctuation 5
 
1.0%
Uppercase Letter 3
 
0.6%
Decimal Number 2
 
0.4%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
42
 
9.0%
21
 
4.5%
20
 
4.3%
18
 
3.9%
15
 
3.2%
15
 
3.2%
15
 
3.2%
15
 
3.2%
14
 
3.0%
10
 
2.2%
Other values (123) 280
60.2%
Uppercase Letter
ValueCountFrequency (%)
B 2
66.7%
C 1
33.3%
Decimal Number
ValueCountFrequency (%)
1 1
50.0%
2 1
50.0%
Space Separator
ValueCountFrequency (%)
10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 5
100.0%
Other Symbol
ValueCountFrequency (%)
5
100.0%
Close Punctuation
ValueCountFrequency (%)
) 5
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 470
94.9%
Common 22
 
4.4%
Latin 3
 
0.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
42
 
8.9%
21
 
4.5%
20
 
4.3%
18
 
3.8%
15
 
3.2%
15
 
3.2%
15
 
3.2%
15
 
3.2%
14
 
3.0%
10
 
2.1%
Other values (124) 285
60.6%
Common
ValueCountFrequency (%)
10
45.5%
( 5
22.7%
) 5
22.7%
1 1
 
4.5%
2 1
 
4.5%
Latin
ValueCountFrequency (%)
B 2
66.7%
C 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 465
93.9%
ASCII 25
 
5.1%
None 5
 
1.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
42
 
9.0%
21
 
4.5%
20
 
4.3%
18
 
3.9%
15
 
3.2%
15
 
3.2%
15
 
3.2%
15
 
3.2%
14
 
3.0%
10
 
2.2%
Other values (123) 280
60.2%
ASCII
ValueCountFrequency (%)
10
40.0%
( 5
20.0%
) 5
20.0%
B 2
 
8.0%
C 1
 
4.0%
1 1
 
4.0%
2 1
 
4.0%
None
ValueCountFrequency (%)
5
100.0%

간기
Categorical

HIGH CORRELATION 

Distinct8
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size772.0 B
월간
35 
일간
24 
주간
11 
격주간
계간
 
2
Other values (3)

Length

Max length3
Median length2
Mean length2.0875
Min length2

Unique

Unique2 ?
Unique (%)2.5%

Sample

1st row월간
2nd row월간
3rd row월간
4th row월간
5th row월간

Common Values

ValueCountFrequency (%)
월간 35
43.8%
일간 24
30.0%
주간 11
 
13.8%
격주간 4
 
5.0%
계간 2
 
2.5%
격월간 2
 
2.5%
원간 1
 
1.2%
격일간 1
 
1.2%

Length

2023-12-12T13:12:54.105496image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T13:12:54.293658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
월간 35
43.8%
일간 24
30.0%
주간 11
 
13.8%
격주간 4
 
5.0%
계간 2
 
2.5%
격월간 2
 
2.5%
원간 1
 
1.2%
격일간 1
 
1.2%

Interactions

2023-12-12T13:12:51.169804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T13:12:54.404983image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분간행물명발행사간기
연번1.0000.9700.9390.9030.602
구분0.9701.0000.0000.9280.848
간행물명0.9390.0001.0001.0001.000
발행사0.9030.9281.0001.0000.809
간기0.6020.8481.0000.8091.000
2023-12-12T13:12:54.508096image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
간기구분
간기1.0000.508
구분0.5081.000
2023-12-12T13:12:54.603795image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번구분간기
연번1.0000.8750.334
구분0.8751.0000.508
간기0.3340.5081.000

Missing values

2023-12-12T13:12:51.319783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T13:12:51.457774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

연번구분간행물명발행사간기
01종이잡지BBC 사이언스BBC사이언스 편집부월간
12종이잡지건강다이제스트건강다이제스트사월간
23종이잡지골프다이제스트골프다이제스트월간
34종이잡지과학동아동아사이언스월간
45종이잡지더그아웃대단한미디어월간
56종이잡지매경이코노미매일경제신문사주간
67종이잡지베스트일레븐㈜베스트일레븐월간
78종이잡지빅이슈코리아빅이슈코리아격주간
89종이잡지수학동아동아사이언스월간
910종이잡지시사IN참언론주간
연번구분간행물명발행사간기
7071전자신문한국일보한국일보사일간
7172전자신문머니투데이㈜머니투데이일간
7273전자신문서울경제서울경제신문사일간
7374전자신문코리아헤럴드헤럴드일간
7475전자신문경남도민일보경남도민일보사일간
7576전자신문일간스포츠일간스포츠일간
7677전자신문파이낸셜뉴스파이낸셜뉴스신문일간
7778전자신문헤럴드경제헤럴드일간
7879전자신문경상일보경상일보사일간
7980전자신문부산일보부산일보사일간