Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 34 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 1.6 KiB |
Average record size in memory | 46.9 B |
Variable types
Numeric | 2 |
---|---|
Categorical | 1 |
Text | 1 |
Boolean | 1 |
Dataset
Description | 한국교육학술정보원에서 운영하는 RISS 사서커뮤니티 게시판의 전국대학 사서 이용자가 작성한 글의 말머리 정보를 제공합니다. |
---|---|
Author | 한국교육학술정보원 |
URL | https://www.data.go.kr/data/15071955/fileData.do |
FLAG has constant value "" | Constant |
SUBJECT ID is highly overall correlated with 게시판ID | High correlation |
게시판ID is highly overall correlated with SUBJECT ID | High correlation |
SUBJECT ID has unique values | Unique |
게시판말머리명 has unique values | Unique |
Reproduction
Analysis started | 2023-12-12 12:45:19.820987 |
---|---|
Analysis finished | 2023-12-12 12:45:20.520126 |
Duration | 0.7 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
SUBJECT ID
Real number (ℝ)
HIGH CORRELATION
  UNIQUE
 
Distinct | 34 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 29.852941 |
Minimum | 6 |
---|---|
Maximum | 49 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 438.0 B |
Quantile statistics
Minimum | 6 |
---|---|
5-th percentile | 7.65 |
Q1 | 16.75 |
median | 32.5 |
Q3 | 40.75 |
95-th percentile | 47.35 |
Maximum | 49 |
Range | 43 |
Interquartile range (IQR) | 24 |
Descriptive statistics
Standard deviation | 13.689484 |
---|---|
Coefficient of variation (CV) | 0.45856398 |
Kurtosis | -1.0929905 |
Mean | 29.852941 |
Median Absolute Deviation (MAD) | 9 |
Skewness | -0.46431735 |
Sum | 1015 |
Variance | 187.40196 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
11 | 1 | 2.9% |
32 | 1 | 2.9% |
26 | 1 | 2.9% |
27 | 1 | 2.9% |
28 | 1 | 2.9% |
29 | 1 | 2.9% |
30 | 1 | 2.9% |
31 | 1 | 2.9% |
33 | 1 | 2.9% |
45 | 1 | 2.9% |
Other values (24) | 24 |
Value | Count | Frequency (%) |
6 | 1 | |
7 | 1 | |
8 | 1 | |
9 | 1 | |
10 | 1 | |
11 | 1 | |
12 | 1 | |
13 | 1 | |
14 | 1 | |
25 | 1 |
Value | Count | Frequency (%) |
49 | 1 | |
48 | 1 | |
47 | 1 | |
46 | 1 | |
45 | 1 | |
44 | 1 | |
43 | 1 | |
42 | 1 | |
41 | 1 | |
40 | 1 |
게시판ID
Categorical
HIGH CORRELATION
 
Distinct | 4 |
---|---|
Distinct (%) | 11.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 404.0 B |
48 | |
---|---|
2 | |
49 | |
87 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 1.7352941 |
Min length | 1 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2 |
---|---|
2nd row | 2 |
3rd row | 2 |
4th row | 2 |
5th row | 2 |
Common Values
Value | Count | Frequency (%) |
48 | 12 | |
2 | 9 | |
49 | 8 | |
87 | 5 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
48 | 12 | |
2 | 9 | |
49 | 8 | |
87 | 5 |
게시판말머리명
Text
UNIQUE
 
Distinct | 34 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 404.0 B |
Value | Count | Frequency (%) |
및 | 2 | 5.0% |
통계/평가 | 2 | 5.0% |
사서이야기 | 1 | 2.5% |
전산 | 1 | 2.5% |
협상보고서 | 1 | 2.5% |
품목별참고자료 | 1 | 2.5% |
2017 | 1 | 2.5% |
상호대차 | 1 | 2.5% |
수서 | 1 | 2.5% |
도서관이야기 | 1 | 2.5% |
Other values (28) | 28 |
Most occurring characters
Value | Count | Frequency (%) |
2 | 9 | 5.6% |
1 | 9 | 5.6% |
0 | 8 | 5.0% |
6 | 3.7% | |
서 | 5 | 3.1% |
기 | 5 | 3.1% |
사 | 4 | 2.5% |
자 | 4 | 2.5% |
고 | 4 | 2.5% |
가 | 4 | 2.5% |
Other values (72) | 103 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 117 | |
Decimal Number | 32 | 19.9% |
Space Separator | 6 | 3.7% |
Other Punctuation | 4 | 2.5% |
Uppercase Letter | 2 | 1.2% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
서 | 5 | 4.3% |
기 | 5 | 4.3% |
사 | 4 | 3.4% |
자 | 4 | 3.4% |
고 | 4 | 3.4% |
가 | 4 | 3.4% |
이 | 4 | 3.4% |
상 | 3 | 2.6% |
료 | 3 | 2.6% |
평 | 3 | 2.6% |
Other values (58) | 78 |
Decimal Number
Value | Count | Frequency (%) |
2 | 9 | |
1 | 9 | |
0 | 8 | |
7 | 1 | 3.1% |
5 | 1 | 3.1% |
4 | 1 | 3.1% |
3 | 1 | 3.1% |
8 | 1 | 3.1% |
6 | 1 | 3.1% |
Other Punctuation
Value | Count | Frequency (%) |
/ | 2 | |
& | 2 |
Uppercase Letter
Value | Count | Frequency (%) |
Q | 1 | |
A | 1 |
Space Separator
Value | Count | Frequency (%) |
6 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 117 | |
Common | 42 | 26.1% |
Latin | 2 | 1.2% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
서 | 5 | 4.3% |
기 | 5 | 4.3% |
사 | 4 | 3.4% |
자 | 4 | 3.4% |
고 | 4 | 3.4% |
가 | 4 | 3.4% |
이 | 4 | 3.4% |
상 | 3 | 2.6% |
료 | 3 | 2.6% |
평 | 3 | 2.6% |
Other values (58) | 78 |
Common
Value | Count | Frequency (%) |
2 | 9 | |
1 | 9 | |
0 | 8 | |
6 | ||
/ | 2 | 4.8% |
& | 2 | 4.8% |
7 | 1 | 2.4% |
5 | 1 | 2.4% |
4 | 1 | 2.4% |
3 | 1 | 2.4% |
Other values (2) | 2 | 4.8% |
Latin
Value | Count | Frequency (%) |
Q | 1 | |
A | 1 |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 117 | |
ASCII | 44 | 27.3% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
2 | 9 | |
1 | 9 | |
0 | 8 | |
6 | ||
/ | 2 | 4.5% |
& | 2 | 4.5% |
7 | 1 | 2.3% |
Q | 1 | 2.3% |
A | 1 | 2.3% |
5 | 1 | 2.3% |
Other values (4) | 4 |
Hangul
Value | Count | Frequency (%) |
서 | 5 | 4.3% |
기 | 5 | 4.3% |
사 | 4 | 3.4% |
자 | 4 | 3.4% |
고 | 4 | 3.4% |
가 | 4 | 3.4% |
이 | 4 | 3.4% |
상 | 3 | 2.6% |
료 | 3 | 2.6% |
평 | 3 | 2.6% |
Other values (58) | 78 |
출력순서
Real number (ℝ)
Distinct | 12 |
---|---|
Distinct (%) | 35.3% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 5.1176471 |
Minimum | 1 |
---|---|
Maximum | 12 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 438.0 B |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 3 |
median | 5 |
Q3 | 7 |
95-th percentile | 10.35 |
Maximum | 12 |
Range | 11 |
Interquartile range (IQR) | 4 |
Descriptive statistics
Standard deviation | 3.0327975 |
---|---|
Coefficient of variation (CV) | 0.59261561 |
Kurtosis | -0.57581109 |
Mean | 5.1176471 |
Median Absolute Deviation (MAD) | 2 |
Skewness | 0.47647695 |
Sum | 174 |
Variance | 9.197861 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
4 | 4 | |
3 | 4 | |
1 | 4 | |
5 | 4 | |
2 | 4 | |
8 | 3 | |
7 | 3 | |
6 | 3 | |
9 | 2 | |
11 | 1 | 2.9% |
Other values (2) | 2 |
Value | Count | Frequency (%) |
1 | 4 | |
2 | 4 | |
3 | 4 | |
4 | 4 | |
5 | 4 | |
6 | 3 | |
7 | 3 | |
8 | 3 | |
9 | 2 | |
10 | 1 | 2.9% |
Value | Count | Frequency (%) |
12 | 1 | 2.9% |
11 | 1 | 2.9% |
10 | 1 | 2.9% |
9 | 2 | |
8 | 3 | |
7 | 3 | |
6 | 3 | |
5 | 4 | |
4 | 4 | |
3 | 4 |
FLAG
Boolean
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 2.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 166.0 B |
True |
---|
Value | Count | Frequency (%) |
True | 34 |
SUBJECT ID | 게시판ID | 게시판말머리명 | 출력순서 | |
---|---|---|---|---|
SUBJECT ID | 1.000 | 0.994 | 1.000 | 0.000 |
게시판ID | 0.994 | 1.000 | 1.000 | 0.000 |
게시판말머리명 | 1.000 | 1.000 | 1.000 | 1.000 |
출력순서 | 0.000 | 0.000 | 1.000 | 1.000 |
SUBJECT ID | 출력순서 | 게시판ID | |
---|---|---|---|
SUBJECT ID | 1.000 | -0.236 | 0.836 |
출력순서 | -0.236 | 1.000 | 0.000 |
게시판ID | 0.836 | 0.000 | 1.000 |
SUBJECT ID | 게시판ID | 게시판말머리명 | 출력순서 | FLAG | |
---|---|---|---|---|---|
0 | 11 | 2 | 사서이야기 | 4 | Y |
1 | 12 | 2 | 도서관이야기 | 3 | Y |
2 | 13 | 2 | 건의&제안 | 9 | Y |
3 | 14 | 2 | 가입인사 | 1 | Y |
4 | 6 | 2 | 질문있습니다 | 8 | Y |
5 | 7 | 2 | 감동적임 | 7 | Y |
6 | 8 | 2 | 웃겨요 | 6 | Y |
7 | 9 | 2 | 소소한이야기 | 5 | Y |
8 | 10 | 2 | 도서관관련 | 2 | Y |
9 | 36 | 49 | 2016 | 6 | Y |
SUBJECT ID | 게시판ID | 게시판말머리명 | 출력순서 | FLAG | |
---|---|---|---|---|---|
24 | 30 | 48 | 참고봉사 및 열람 | 5 | Y |
25 | 31 | 48 | 정리 | 4 | Y |
26 | 32 | 48 | 전산 | 3 | Y |
27 | 33 | 48 | 수서 | 2 | Y |
28 | 34 | 48 | 상호대차 | 1 | Y |
29 | 35 | 49 | 2017 | 7 | Y |
30 | 46 | 87 | 품목별참고자료 | 4 | Y |
31 | 47 | 87 | 협상보고서 | 3 | Y |
32 | 48 | 87 | 회의자료 | 2 | Y |
33 | 49 | 87 | 협상회의록 | 1 | Y |