Overview

Dataset statistics

Number of variables7
Number of observations192
Missing cells192
Missing cells (%)14.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory11.6 KiB
Average record size in memory61.7 B

Variable types

Numeric3
Text2
Categorical1
Unsupported1

Dataset

Description한국교육학술정보원에서 운영하는 RISS 사서커뮤니티 게시글에 첨부된 파일입니다. 각종 회의, 협의회 등의 회의록 들 다양한 파일을 제공합니다.
Author한국교육학술정보원
URLhttps://www.data.go.kr/data/15071958/fileData.do

Alerts

다운로드횟수 has constant value ""Constant
파일 BLOG has 192 (100.0%) missing valuesMissing
파일명 has unique valuesUnique
파일 BLOG is an unsupported type, check if it needs cleaning or further analysisUnsupported
파일순서 has 136 (70.8%) zerosZeros

Reproduction

Analysis started2023-12-12 19:59:34.196780
Analysis finished2023-12-12 19:59:35.749585
Duration1.55 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

게시글ID
Real number (ℝ)

Distinct138
Distinct (%)71.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17590.474
Minimum17021
Maximum17671
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-13T04:59:35.815352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum17021
5-th percentile17473.1
Q117556
median17606.5
Q317647.25
95-th percentile17666.45
Maximum17671
Range650
Interquartile range (IQR)91.25

Descriptive statistics

Standard deviation72.849262
Coefficient of variation (CV)0.0041414042
Kurtosis18.268571
Mean17590.474
Median Absolute Deviation (MAD)47.5
Skewness-2.8320719
Sum3377371
Variance5307.015
MonotonicityNot monotonic
2023-12-13T04:59:35.966137image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
17556 7
 
3.6%
17657 7
 
3.6%
17658 6
 
3.1%
17633 4
 
2.1%
17557 4
 
2.1%
17453 4
 
2.1%
17656 4
 
2.1%
17619 3
 
1.6%
17653 3
 
1.6%
17627 3
 
1.6%
Other values (128) 147
76.6%
ValueCountFrequency (%)
17021 1
 
0.5%
17451 1
 
0.5%
17452 1
 
0.5%
17453 4
2.1%
17468 1
 
0.5%
17470 1
 
0.5%
17472 1
 
0.5%
17474 1
 
0.5%
17476 1
 
0.5%
17478 1
 
0.5%
ValueCountFrequency (%)
17671 1
 
0.5%
17670 3
1.6%
17669 3
1.6%
17668 2
1.0%
17667 1
 
0.5%
17666 2
1.0%
17665 1
 
0.5%
17664 1
 
0.5%
17663 1
 
0.5%
17662 1
 
0.5%

파일순서
Real number (ℝ)

ZEROS 

Distinct7
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.63541667
Minimum0
Maximum6
Zeros136
Zeros (%)70.8%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-13T04:59:36.083270image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile3.45
Maximum6
Range6
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.2542685
Coefficient of variation (CV)1.9739307
Kurtosis5.0903733
Mean0.63541667
Median Absolute Deviation (MAD)0
Skewness2.311142
Sum122
Variance1.5731894
MonotonicityNot monotonic
2023-12-13T04:59:36.198052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=7)
ValueCountFrequency (%)
0 136
70.8%
1 26
 
13.5%
2 11
 
5.7%
3 9
 
4.7%
4 5
 
2.6%
5 3
 
1.6%
6 2
 
1.0%
ValueCountFrequency (%)
0 136
70.8%
1 26
 
13.5%
2 11
 
5.7%
3 9
 
4.7%
4 5
 
2.6%
5 3
 
1.6%
6 2
 
1.0%
ValueCountFrequency (%)
6 2
 
1.0%
5 3
 
1.6%
4 5
 
2.6%
3 9
 
4.7%
2 11
 
5.7%
1 26
 
13.5%
0 136
70.8%

파일명
Text

UNIQUE 

Distinct192
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-13T04:59:36.414440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length14
Mean length14
Min length14

Characters and Unicode

Total characters2688
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique192 ?
Unique (%)100.0%

Sample

1st row000000017021_0
2nd row000000017451_0
3rd row000000017452_0
4th row000000017453_0
5th row000000017453_1
ValueCountFrequency (%)
000000017021_0 1
 
0.5%
000000017451_0 1
 
0.5%
000000017633_1 1
 
0.5%
000000017627_1 1
 
0.5%
000000017627_2 1
 
0.5%
000000017627_3 1
 
0.5%
000000017628_0 1
 
0.5%
000000017629_0 1
 
0.5%
000000017630_0 1
 
0.5%
000000017631_0 1
 
0.5%
Other values (182) 182
94.8%
2023-12-13T04:59:36.801507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1511
56.2%
1 255
 
9.5%
7 233
 
8.7%
_ 192
 
7.1%
6 150
 
5.6%
5 142
 
5.3%
3 50
 
1.9%
2 44
 
1.6%
4 43
 
1.6%
8 36
 
1.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 2496
92.9%
Connector Punctuation 192
 
7.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1511
60.5%
1 255
 
10.2%
7 233
 
9.3%
6 150
 
6.0%
5 142
 
5.7%
3 50
 
2.0%
2 44
 
1.8%
4 43
 
1.7%
8 36
 
1.4%
9 32
 
1.3%
Connector Punctuation
ValueCountFrequency (%)
_ 192
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 2688
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1511
56.2%
1 255
 
9.5%
7 233
 
8.7%
_ 192
 
7.1%
6 150
 
5.6%
5 142
 
5.3%
3 50
 
1.9%
2 44
 
1.6%
4 43
 
1.6%
8 36
 
1.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2688
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1511
56.2%
1 255
 
9.5%
7 233
 
8.7%
_ 192
 
7.1%
6 150
 
5.6%
5 142
 
5.3%
3 50
 
1.9%
2 44
 
1.6%
4 43
 
1.6%
8 36
 
1.3%
Distinct190
Distinct (%)99.0%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
2023-12-13T04:59:37.049985image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length58
Median length43
Mean length23.848958
Min length8

Characters and Unicode

Total characters4579
Distinct characters430
Distinct categories12 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique188 ?
Unique (%)97.9%

Sample

1st row이웃집도서관엿보기(12).pdf
2nd row[최종보고서] 대학도서관 창의협력 학습 환경 구축 연구_최종 (5).pdf
3rd row(19-23)발전계획 분석 이슈리포트(최종본).pdf
4th row2019년 학술정보공유 사서지원단 운영협의회 회의자료_(1차_190328).hwp
5th row2019년 학술정보공유 사서지원단 운영협의회 회의록_0328.hwp
ValueCountFrequency (%)
협상회의록 16
 
2.4%
협상 11
 
1.6%
회의록 9
 
1.3%
keris 8
 
1.2%
7
 
1.0%
bentham 7
 
1.0%
science 7
 
1.0%
정현희).hwp 6
 
0.9%
계속구독품목 6
 
0.9%
대학라이선스 6
 
0.9%
Other values (476) 591
87.7%
2023-12-13T04:59:37.448765image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
482
 
10.5%
. 216
 
4.7%
p 176
 
3.8%
_ 119
 
2.6%
91
 
2.0%
g 90
 
2.0%
j 88
 
1.9%
2 87
 
1.9%
77
 
1.7%
1 77
 
1.7%
Other values (420) 3076
67.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2008
43.9%
Lowercase Letter 918
20.0%
Space Separator 482
 
10.5%
Decimal Number 352
 
7.7%
Uppercase Letter 297
 
6.5%
Other Punctuation 226
 
4.9%
Connector Punctuation 119
 
2.6%
Open Punctuation 81
 
1.8%
Close Punctuation 80
 
1.7%
Dash Punctuation 11
 
0.2%
Other values (2) 5
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
91
 
4.5%
77
 
3.8%
75
 
3.7%
70
 
3.5%
67
 
3.3%
60
 
3.0%
49
 
2.4%
33
 
1.6%
30
 
1.5%
26
 
1.3%
Other values (348) 1430
71.2%
Lowercase Letter
ValueCountFrequency (%)
p 176
19.2%
g 90
 
9.8%
j 88
 
9.6%
h 70
 
7.6%
w 60
 
6.5%
l 48
 
5.2%
e 47
 
5.1%
x 38
 
4.1%
i 37
 
4.0%
f 32
 
3.5%
Other values (15) 232
25.3%
Uppercase Letter
ValueCountFrequency (%)
R 34
11.4%
S 34
11.4%
L 26
 
8.8%
E 26
 
8.8%
I 20
 
6.7%
O 16
 
5.4%
K 15
 
5.1%
M 15
 
5.1%
P 15
 
5.1%
B 14
 
4.7%
Other values (14) 82
27.6%
Decimal Number
ValueCountFrequency (%)
2 87
24.7%
1 77
21.9%
0 72
20.5%
3 27
 
7.7%
9 20
 
5.7%
4 18
 
5.1%
6 14
 
4.0%
5 14
 
4.0%
7 12
 
3.4%
8 11
 
3.1%
Other Punctuation
ValueCountFrequency (%)
. 216
95.6%
, 5
 
2.2%
& 4
 
1.8%
! 1
 
0.4%
Open Punctuation
ValueCountFrequency (%)
( 73
90.1%
[ 8
 
9.9%
Close Punctuation
ValueCountFrequency (%)
) 72
90.0%
] 8
 
10.0%
Space Separator
ValueCountFrequency (%)
482
100.0%
Connector Punctuation
ValueCountFrequency (%)
_ 119
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Other Symbol
ValueCountFrequency (%)
4
100.0%
Math Symbol
ValueCountFrequency (%)
+ 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2008
43.9%
Common 1356
29.6%
Latin 1215
26.5%

Most frequent character per script

Hangul
ValueCountFrequency (%)
91
 
4.5%
77
 
3.8%
75
 
3.7%
70
 
3.5%
67
 
3.3%
60
 
3.0%
49
 
2.4%
33
 
1.6%
30
 
1.5%
26
 
1.3%
Other values (348) 1430
71.2%
Latin
ValueCountFrequency (%)
p 176
 
14.5%
g 90
 
7.4%
j 88
 
7.2%
h 70
 
5.8%
w 60
 
4.9%
l 48
 
4.0%
e 47
 
3.9%
x 38
 
3.1%
i 37
 
3.0%
R 34
 
2.8%
Other values (39) 527
43.4%
Common
ValueCountFrequency (%)
482
35.5%
. 216
15.9%
_ 119
 
8.8%
2 87
 
6.4%
1 77
 
5.7%
( 73
 
5.4%
) 72
 
5.3%
0 72
 
5.3%
3 27
 
2.0%
9 20
 
1.5%
Other values (13) 111
 
8.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 2567
56.1%
Hangul 2008
43.9%
Misc Symbols 4
 
0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
482
18.8%
. 216
 
8.4%
p 176
 
6.9%
_ 119
 
4.6%
g 90
 
3.5%
j 88
 
3.4%
2 87
 
3.4%
1 77
 
3.0%
( 73
 
2.8%
) 72
 
2.8%
Other values (61) 1087
42.3%
Hangul
ValueCountFrequency (%)
91
 
4.5%
77
 
3.8%
75
 
3.7%
70
 
3.5%
67
 
3.3%
60
 
3.0%
49
 
2.4%
33
 
1.6%
30
 
1.5%
26
 
1.3%
Other values (348) 1430
71.2%
Misc Symbols
ValueCountFrequency (%)
4
100.0%

파일사이즈
Real number (ℝ)

Distinct174
Distinct (%)90.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean958680.76
Minimum9132
Maximum46414527
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.8 KiB
2023-12-13T04:59:37.585627image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9132
5-th percentile16384
Q167456
median130511.5
Q3239511.5
95-th percentile3364531.6
Maximum46414527
Range46405395
Interquartile range (IQR)172055.5

Descriptive statistics

Standard deviation4232458.3
Coefficient of variation (CV)4.4148777
Kurtosis74.726211
Mean958680.76
Median Absolute Deviation (MAD)85767
Skewness7.9561883
Sum1.8406671 × 108
Variance1.7913703 × 1013
MonotonicityNot monotonic
2023-12-13T04:59:37.747050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
32768 5
 
2.6%
16896 4
 
2.1%
16384 4
 
2.1%
15872 3
 
1.6%
75264 2
 
1.0%
14848 2
 
1.0%
18735563 2
 
1.0%
92160 2
 
1.0%
97280 2
 
1.0%
57344 2
 
1.0%
Other values (164) 164
85.4%
ValueCountFrequency (%)
9132 1
 
0.5%
12800 1
 
0.5%
14336 1
 
0.5%
14848 2
1.0%
15603 1
 
0.5%
15872 3
1.6%
16384 4
2.1%
16896 4
2.1%
17104 1
 
0.5%
17920 1
 
0.5%
ValueCountFrequency (%)
46414527 1
0.5%
19825905 1
0.5%
18735563 2
1.0%
12918814 1
0.5%
8164357 1
0.5%
4569600 1
0.5%
4490752 1
0.5%
3826526 1
0.5%
3392404 1
0.5%
3341727 1
0.5%

다운로드횟수
Categorical

CONSTANT 

Distinct1
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size1.6 KiB
0
192 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 192
100.0%

Length

2023-12-13T04:59:37.875730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T04:59:37.965201image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 192
100.0%

파일 BLOG
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing192
Missing (%)100.0%
Memory size1.8 KiB

Interactions

2023-12-13T04:59:35.036479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:59:34.511214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:59:34.802036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:59:35.395753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:59:34.623869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:59:34.885216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:59:35.475921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:59:34.719590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T04:59:34.963351image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T04:59:38.018831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
게시글ID파일순서파일사이즈
게시글ID1.0000.1450.175
파일순서0.1451.0000.223
파일사이즈0.1750.2231.000
2023-12-13T04:59:38.106215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
게시글ID파일순서파일사이즈
게시글ID1.0000.251-0.442
파일순서0.2511.000-0.135
파일사이즈-0.442-0.1351.000

Missing values

2023-12-13T04:59:35.590015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T04:59:35.701513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

게시글ID파일순서파일명원본파일명파일사이즈다운로드횟수파일 BLOG
0170210000000017021_0이웃집도서관엿보기(12).pdf464145270<NA>
1174510000000017451_0[최종보고서] 대학도서관 창의협력 학습 환경 구축 연구_최종 (5).pdf198259050<NA>
2174520000000017452_0(19-23)발전계획 분석 이슈리포트(최종본).pdf19114250<NA>
3174530000000017453_02019년 학술정보공유 사서지원단 운영협의회 회의자료_(1차_190328).hwp15800320<NA>
4174531000000017453_12019년 학술정보공유 사서지원단 운영협의회 회의록_0328.hwp967680<NA>
5174533000000017453_32019년 학술정보공유 사서지원단 운영협의회 회의록_0715.hwp972800<NA>
6174534000000017453_42019년 학술정보공유 사서지원단 운영협의회 회의자료(2차_190715).hwp1008640<NA>
7174680000000017468_0좀 이상하지만 재미있는 녀석들.jpg1106730<NA>
8174700000000017470_0나의 기억을 보라.jpg2304820<NA>
9174720000000017472_0슬픔의 위로.jpg1001360<NA>
게시글ID파일순서파일명원본파일명파일사이즈다운로드횟수파일 BLOG
182176670000000017667_0JoVE_2차 협상회의록.hwp179200<NA>
183176680000000017668_0협상회의록 1차(8.27.).hwp327680<NA>
184176681000000017668_1협상회의록 2차 (9.17.).hwp327680<NA>
185176690000000017669_021년 인문학 강화 독후감 공모전 공둥 주관기관 수요조사 협조 요청 공문_1684_200916.pdf896230<NA>
186176691000000017669_1붙임1. 21년 독후감 공모전 공동 주관기관 수요조사 계획.pdf1967680<NA>
187176692000000017669_2붙임2. 수요조사지.hwp128000<NA>
188176700000000017670_0협상회의록_Oxford English Dictionary(OED)_3차.hwp291840<NA>
189176701000000017670_1Oxford_Reference_Title List_403종.xlsx735890<NA>
190176702000000017670_2OXFORD REFERENCE TURNAWAY July 2019 to July 2020 (2).xlsx560800<NA>
191176710000000017671_02020_KERIS_종합목록입력지침교육-단행본.pdf26265680<NA>