Overview

Dataset statistics

Number of variables3
Number of observations36
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.0 KiB
Average record size in memory28.7 B

Variable types

Numeric1
Text1
Categorical1

Dataset

Description일본군"위안부"피해자 e역사관 시스템 관리용 자료입니다. 학술 자료집의 목록으로 번호와 제목이 기재되어 있습니다.
Author여성가족부
URLhttps://www.data.go.kr/data/15065342/fileData.do

Alerts

데이터 기준일 has constant value ""Constant
번호 has unique valuesUnique
제목 has unique valuesUnique

Reproduction

Analysis started2023-12-12 22:24:23.744638
Analysis finished2023-12-12 22:24:24.148765
Duration0.4 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18.5
Minimum1
Maximum36
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size456.0 B
2023-12-13T07:24:24.247128image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.75
Q19.75
median18.5
Q327.25
95-th percentile34.25
Maximum36
Range35
Interquartile range (IQR)17.5

Descriptive statistics

Standard deviation10.535654
Coefficient of variation (CV)0.5694948
Kurtosis-1.2
Mean18.5
Median Absolute Deviation (MAD)9
Skewness0
Sum666
Variance111
MonotonicityStrictly increasing
2023-12-13T07:24:24.399756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
1 1
 
2.8%
20 1
 
2.8%
22 1
 
2.8%
23 1
 
2.8%
24 1
 
2.8%
25 1
 
2.8%
26 1
 
2.8%
27 1
 
2.8%
28 1
 
2.8%
29 1
 
2.8%
Other values (26) 26
72.2%
ValueCountFrequency (%)
1 1
2.8%
2 1
2.8%
3 1
2.8%
4 1
2.8%
5 1
2.8%
6 1
2.8%
7 1
2.8%
8 1
2.8%
9 1
2.8%
10 1
2.8%
ValueCountFrequency (%)
36 1
2.8%
35 1
2.8%
34 1
2.8%
33 1
2.8%
32 1
2.8%
31 1
2.8%
30 1
2.8%
29 1
2.8%
28 1
2.8%
27 1
2.8%

제목
Text

UNIQUE 

Distinct36
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size420.0 B
2023-12-13T07:24:24.664453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length70
Median length39.5
Mean length33.722222
Min length18

Characters and Unicode

Total characters1214
Distinct characters181
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique36 ?
Unique (%)100.0%

Sample

1st row2001 해외거주 일본군위안부 실태조사
2nd row2001 일본군위안부 동원.운영의 강제성에 대한 자료수집 및 분석정리
3rd row2001 일본군위안부 증언 통계 자료집
4th row2002 일본군위안부 문제에 대한 기업책임(선박, 상선회사)
5th row2002 강제동원기 기업위안부에 관한 연구
ValueCountFrequency (%)
일본군위안부 11
 
4.8%
10
 
4.4%
관련 9
 
3.9%
2002 8
 
3.5%
7
 
3.1%
대한 5
 
2.2%
문제에 5
 
2.2%
일본군'위안부'문제 4
 
1.8%
종합보고서 4
 
1.8%
발굴정리해제사업 4
 
1.8%
Other values (122) 161
70.6%
2023-12-13T07:24:25.115839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
193
 
15.9%
0 43
 
3.5%
40
 
3.3%
39
 
3.2%
38
 
3.1%
37
 
3.0%
37
 
3.0%
2 32
 
2.6%
32
 
2.6%
28
 
2.3%
Other values (171) 695
57.2%

Most occurring categories

ValueCountFrequency (%)
Other Letter 805
66.3%
Space Separator 193
 
15.9%
Decimal Number 97
 
8.0%
Lowercase Letter 34
 
2.8%
Other Punctuation 29
 
2.4%
Final Punctuation 18
 
1.5%
Dash Punctuation 11
 
0.9%
Close Punctuation 10
 
0.8%
Open Punctuation 10
 
0.8%
Uppercase Letter 7
 
0.6%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
40
 
5.0%
39
 
4.8%
38
 
4.7%
37
 
4.6%
37
 
4.6%
32
 
4.0%
28
 
3.5%
25
 
3.1%
21
 
2.6%
20
 
2.5%
Other values (134) 488
60.6%
Lowercase Letter
ValueCountFrequency (%)
e 5
14.7%
o 4
11.8%
r 3
8.8%
a 3
8.8%
t 3
8.8%
h 3
8.8%
m 2
 
5.9%
f 2
 
5.9%
i 2
 
5.9%
n 2
 
5.9%
Other values (5) 5
14.7%
Decimal Number
ValueCountFrequency (%)
0 43
44.3%
2 32
33.0%
1 10
 
10.3%
4 5
 
5.2%
3 4
 
4.1%
6 2
 
2.1%
5 1
 
1.0%
Other Punctuation
ValueCountFrequency (%)
' 20
69.0%
, 4
 
13.8%
" 2
 
6.9%
. 2
 
6.9%
? 1
 
3.4%
Uppercase Letter
ValueCountFrequency (%)
T 3
42.9%
C 1
 
14.3%
W 1
 
14.3%
M 1
 
14.3%
J 1
 
14.3%
Space Separator
ValueCountFrequency (%)
193
100.0%
Final Punctuation
ValueCountFrequency (%)
18
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 805
66.3%
Common 368
30.3%
Latin 41
 
3.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
40
 
5.0%
39
 
4.8%
38
 
4.7%
37
 
4.6%
37
 
4.6%
32
 
4.0%
28
 
3.5%
25
 
3.1%
21
 
2.6%
20
 
2.5%
Other values (134) 488
60.6%
Latin
ValueCountFrequency (%)
e 5
12.2%
o 4
 
9.8%
r 3
 
7.3%
a 3
 
7.3%
t 3
 
7.3%
h 3
 
7.3%
T 3
 
7.3%
m 2
 
4.9%
f 2
 
4.9%
i 2
 
4.9%
Other values (10) 11
26.8%
Common
ValueCountFrequency (%)
193
52.4%
0 43
 
11.7%
2 32
 
8.7%
' 20
 
5.4%
18
 
4.9%
- 11
 
3.0%
1 10
 
2.7%
) 10
 
2.7%
( 10
 
2.7%
4 5
 
1.4%
Other values (7) 16
 
4.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 805
66.3%
ASCII 391
32.2%
Punctuation 18
 
1.5%

Most frequent character per block

ASCII
ValueCountFrequency (%)
193
49.4%
0 43
 
11.0%
2 32
 
8.2%
' 20
 
5.1%
- 11
 
2.8%
1 10
 
2.6%
) 10
 
2.6%
( 10
 
2.6%
e 5
 
1.3%
4 5
 
1.3%
Other values (26) 52
 
13.3%
Hangul
ValueCountFrequency (%)
40
 
5.0%
39
 
4.8%
38
 
4.7%
37
 
4.6%
37
 
4.6%
32
 
4.0%
28
 
3.5%
25
 
3.1%
21
 
2.6%
20
 
2.5%
Other values (134) 488
60.6%
Punctuation
ValueCountFrequency (%)
18
100.0%

데이터 기준일
Categorical

CONSTANT 

Distinct1
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Memory size420.0 B
2021-08-19
36 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-08-19
2nd row2021-08-19
3rd row2021-08-19
4th row2021-08-19
5th row2021-08-19

Common Values

ValueCountFrequency (%)
2021-08-19 36
100.0%

Length

2023-12-13T07:24:25.286853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T07:24:25.414407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-08-19 36
100.0%

Interactions

2023-12-13T07:24:23.923361image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T07:24:25.485440image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
번호제목
번호1.0001.000
제목1.0001.000

Missing values

2023-12-13T07:24:24.035710image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T07:24:24.118725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

번호제목데이터 기준일
012001 해외거주 일본군위안부 실태조사2021-08-19
122001 일본군위안부 동원.운영의 강제성에 대한 자료수집 및 분석정리2021-08-19
232001 일본군위안부 증언 통계 자료집2021-08-19
342002 일본군위안부 문제에 대한 기업책임(선박, 상선회사)2021-08-19
452002 강제동원기 기업위안부에 관한 연구2021-08-19
562002 미연방기록물보존소에서 발굴된 자료2021-08-19
672002 (그 말을 어디다 다 할 꼬)피해자 증언 자료집2021-08-19
782002 국외거주 일본군위안부 피해자 실태조사2021-08-19
892002 일본군위안부 문제에 대한 국제사회동향 분석2021-08-19
9102002 위안부 관련 이해를 위한 기초입문2021-08-19
번호제목데이터 기준일
2627일본군’위안부’기록물 발굴정리해제사업 일본군’위안부’발굴정리해제(일본국회, 상) -2021-08-19
2728일본군’위안부’기록물 발굴정리해제사업 일본군’위안부’발굴정리해제(일본국회, 하) -2021-08-19
2829일본군’위안부’기록물 발굴정리해제사업 일본군’위안부’발굴정리해제(일본) -2021-08-19
2930일본군’위안부’기록물 발굴정리해제사업 일본군’위안부’발굴정리해제(영어권) -2021-08-19
3031일본군’위안부’피해자 관련 여성가족부 보유자료 목록2021-08-19
3132일본군'위안부'문제 관련 국내외사례조사 및 향후과제 종합보고서 - 연구성과 기초조사보고서2021-08-19
3233일본군'위안부'문제 관련 국내외사례조사 및 향후과제 종합보고서 - 언론부문 기초조사보고서2021-08-19
3334일본군'위안부'문제 관련 국내외사례조사 및 향후과제 종합보고서 - 일본정부의 일본군'위안부'인식과 정책2021-08-19
3435일본군'위안부'문제 관련 국내외사례조사 및 향후과제 종합보고서 - 일본군'위안부'관련 종합연구 기초조사보고2021-08-19
3536(가칭) 국립 일본군'위안부'연구소 및 역사관 건립을 위한 연구 보고서2021-08-19