Overview

Dataset statistics

Number of variables5
Number of observations34
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.6 KiB
Average record size in memory46.9 B

Variable types

Numeric2
Categorical1
Text1
Boolean1

Dataset

Description한국교육학술정보원에서 운영하는 RISS 사서커뮤니티 게시판의 전국대학 사서 이용자가 작성한 글의 말머리 정보를 제공합니다.
Author한국교육학술정보원
URLhttps://www.data.go.kr/data/15071955/fileData.do

Alerts

FLAG has constant value ""Constant
SUBJECT ID is highly overall correlated with 게시판IDHigh correlation
게시판ID is highly overall correlated with SUBJECT IDHigh correlation
SUBJECT ID has unique valuesUnique
게시판말머리명 has unique valuesUnique

Reproduction

Analysis started2023-12-12 12:45:19.820987
Analysis finished2023-12-12 12:45:20.520126
Duration0.7 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

SUBJECT ID
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean29.852941
Minimum6
Maximum49
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2023-12-12T21:45:20.600640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum6
5-th percentile7.65
Q116.75
median32.5
Q340.75
95-th percentile47.35
Maximum49
Range43
Interquartile range (IQR)24

Descriptive statistics

Standard deviation13.689484
Coefficient of variation (CV)0.45856398
Kurtosis-1.0929905
Mean29.852941
Median Absolute Deviation (MAD)9
Skewness-0.46431735
Sum1015
Variance187.40196
MonotonicityNot monotonic
2023-12-12T21:45:20.740865image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
11 1
 
2.9%
32 1
 
2.9%
26 1
 
2.9%
27 1
 
2.9%
28 1
 
2.9%
29 1
 
2.9%
30 1
 
2.9%
31 1
 
2.9%
33 1
 
2.9%
45 1
 
2.9%
Other values (24) 24
70.6%
ValueCountFrequency (%)
6 1
2.9%
7 1
2.9%
8 1
2.9%
9 1
2.9%
10 1
2.9%
11 1
2.9%
12 1
2.9%
13 1
2.9%
14 1
2.9%
25 1
2.9%
ValueCountFrequency (%)
49 1
2.9%
48 1
2.9%
47 1
2.9%
46 1
2.9%
45 1
2.9%
44 1
2.9%
43 1
2.9%
42 1
2.9%
41 1
2.9%
40 1
2.9%

게시판ID
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)11.8%
Missing0
Missing (%)0.0%
Memory size404.0 B
48
12 
2
49
87

Length

Max length2
Median length2
Mean length1.7352941
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2
2nd row2
3rd row2
4th row2
5th row2

Common Values

ValueCountFrequency (%)
48 12
35.3%
2 9
26.5%
49 8
23.5%
87 5
14.7%

Length

2023-12-12T21:45:20.895804image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T21:45:21.003788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
48 12
35.3%
2 9
26.5%
49 8
23.5%
87 5
14.7%
Distinct34
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size404.0 B
2023-12-12T21:45:21.247438image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length4.7352941
Min length2

Characters and Unicode

Total characters161
Distinct characters82
Distinct categories5 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique34 ?
Unique (%)100.0%

Sample

1st row사서이야기
2nd row도서관이야기
3rd row건의&제안
4th row가입인사
5th row질문있습니다
ValueCountFrequency (%)
2
 
5.0%
통계/평가 2
 
5.0%
사서이야기 1
 
2.5%
전산 1
 
2.5%
협상보고서 1
 
2.5%
품목별참고자료 1
 
2.5%
2017 1
 
2.5%
상호대차 1
 
2.5%
수서 1
 
2.5%
도서관이야기 1
 
2.5%
Other values (28) 28
70.0%
2023-12-12T21:45:21.647495image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2 9
 
5.6%
1 9
 
5.6%
0 8
 
5.0%
6
 
3.7%
5
 
3.1%
5
 
3.1%
4
 
2.5%
4
 
2.5%
4
 
2.5%
4
 
2.5%
Other values (72) 103
64.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 117
72.7%
Decimal Number 32
 
19.9%
Space Separator 6
 
3.7%
Other Punctuation 4
 
2.5%
Uppercase Letter 2
 
1.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
5
 
4.3%
5
 
4.3%
4
 
3.4%
4
 
3.4%
4
 
3.4%
4
 
3.4%
4
 
3.4%
3
 
2.6%
3
 
2.6%
3
 
2.6%
Other values (58) 78
66.7%
Decimal Number
ValueCountFrequency (%)
2 9
28.1%
1 9
28.1%
0 8
25.0%
7 1
 
3.1%
5 1
 
3.1%
4 1
 
3.1%
3 1
 
3.1%
8 1
 
3.1%
6 1
 
3.1%
Other Punctuation
ValueCountFrequency (%)
/ 2
50.0%
& 2
50.0%
Uppercase Letter
ValueCountFrequency (%)
Q 1
50.0%
A 1
50.0%
Space Separator
ValueCountFrequency (%)
6
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 117
72.7%
Common 42
 
26.1%
Latin 2
 
1.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
5
 
4.3%
5
 
4.3%
4
 
3.4%
4
 
3.4%
4
 
3.4%
4
 
3.4%
4
 
3.4%
3
 
2.6%
3
 
2.6%
3
 
2.6%
Other values (58) 78
66.7%
Common
ValueCountFrequency (%)
2 9
21.4%
1 9
21.4%
0 8
19.0%
6
14.3%
/ 2
 
4.8%
& 2
 
4.8%
7 1
 
2.4%
5 1
 
2.4%
4 1
 
2.4%
3 1
 
2.4%
Other values (2) 2
 
4.8%
Latin
ValueCountFrequency (%)
Q 1
50.0%
A 1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 117
72.7%
ASCII 44
 
27.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
2 9
20.5%
1 9
20.5%
0 8
18.2%
6
13.6%
/ 2
 
4.5%
& 2
 
4.5%
7 1
 
2.3%
Q 1
 
2.3%
A 1
 
2.3%
5 1
 
2.3%
Other values (4) 4
9.1%
Hangul
ValueCountFrequency (%)
5
 
4.3%
5
 
4.3%
4
 
3.4%
4
 
3.4%
4
 
3.4%
4
 
3.4%
4
 
3.4%
3
 
2.6%
3
 
2.6%
3
 
2.6%
Other values (58) 78
66.7%

출력순서
Real number (ℝ)

Distinct12
Distinct (%)35.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.1176471
Minimum1
Maximum12
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size438.0 B
2023-12-12T21:45:21.770165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median5
Q37
95-th percentile10.35
Maximum12
Range11
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.0327975
Coefficient of variation (CV)0.59261561
Kurtosis-0.57581109
Mean5.1176471
Median Absolute Deviation (MAD)2
Skewness0.47647695
Sum174
Variance9.197861
MonotonicityNot monotonic
2023-12-12T21:45:21.882593image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
4 4
11.8%
3 4
11.8%
1 4
11.8%
5 4
11.8%
2 4
11.8%
8 3
8.8%
7 3
8.8%
6 3
8.8%
9 2
5.9%
11 1
 
2.9%
Other values (2) 2
5.9%
ValueCountFrequency (%)
1 4
11.8%
2 4
11.8%
3 4
11.8%
4 4
11.8%
5 4
11.8%
6 3
8.8%
7 3
8.8%
8 3
8.8%
9 2
5.9%
10 1
 
2.9%
ValueCountFrequency (%)
12 1
 
2.9%
11 1
 
2.9%
10 1
 
2.9%
9 2
5.9%
8 3
8.8%
7 3
8.8%
6 3
8.8%
5 4
11.8%
4 4
11.8%
3 4
11.8%

FLAG
Boolean

CONSTANT 

Distinct1
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Memory size166.0 B
True
34 
ValueCountFrequency (%)
True 34
100.0%
2023-12-12T21:45:21.989534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Interactions

2023-12-12T21:45:20.193208image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:45:20.004730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:45:20.275449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T21:45:20.098721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T21:45:22.067900image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
SUBJECT ID게시판ID게시판말머리명출력순서
SUBJECT ID1.0000.9941.0000.000
게시판ID0.9941.0001.0000.000
게시판말머리명1.0001.0001.0001.000
출력순서0.0000.0001.0001.000
2023-12-12T21:45:22.182133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
SUBJECT ID출력순서게시판ID
SUBJECT ID1.000-0.2360.836
출력순서-0.2361.0000.000
게시판ID0.8360.0001.000

Missing values

2023-12-12T21:45:20.378843image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T21:45:20.475945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

SUBJECT ID게시판ID게시판말머리명출력순서FLAG
0112사서이야기4Y
1122도서관이야기3Y
2132건의&제안9Y
3142가입인사1Y
462질문있습니다8Y
572감동적임7Y
682웃겨요6Y
792소소한이야기5Y
8102도서관관련2Y
9364920166Y
SUBJECT ID게시판ID게시판말머리명출력순서FLAG
243048참고봉사 및 열람5Y
253148정리4Y
263248전산3Y
273348수서2Y
283448상호대차1Y
29354920177Y
304687품목별참고자료4Y
314787협상보고서3Y
324887회의자료2Y
334987협상회의록1Y