Dataset statistics
Number of variables | 4 |
---|---|
Number of observations | 241 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 1 |
Duplicate rows (%) | 0.4% |
Total size in memory | 7.7 KiB |
Average record size in memory | 32.5 B |
Variable types
Text | 1 |
---|---|
Categorical | 3 |
Dataset
Description | 여성사전시관 인물연구 정보 서비스 정보를 제공합니다. (인물연구명, 인물연구실명, 등록일자, 데이터기준일자) |
---|---|
Author | 여성가족부 |
URL | https://www.data.go.kr/data/15085777/fileData.do |
데이터기준일자 has constant value "" | Constant |
Dataset has 1 (0.4%) duplicate rows | Duplicates |
인물연구실명 is highly overall correlated with 등록일자 | High correlation |
등록일자 is highly overall correlated with 인물연구실명 | High correlation |
Reproduction
Analysis started | 2023-12-12 01:44:13.651127 |
---|---|
Analysis finished | 2023-12-12 01:44:14.145921 |
Duration | 0.49 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
인물연구명
Text
Distinct | 240 |
---|---|
Distinct (%) | 99.6% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.0 KiB |
Length
Max length | 20 |
---|---|
Median length | 16 |
Mean length | 10.481328 |
Min length | 2 |
Characters and Unicode
Total characters | 2526 |
---|---|
Distinct characters | 428 |
Distinct categories | 7 ? |
Distinct scripts | 3 ? |
Distinct blocks | 5 ? |
Unique
Unique | 239 ? |
---|---|
Unique (%) | 99.2% |
Sample
1st row | 231. 평량(平亮)의 처 |
---|---|
2nd row | 가야 용녀(傭女) |
3rd row | 가야의 이뇌왕비(異腦王妃) |
4th row | 강경애(1906~1944) |
5th row | 강빈 |
Value | Count | Frequency (%) |
처 | 8 | 2.6% |
이씨 | 3 | 1.0% |
딸 | 3 | 1.0% |
원덕태후(元德太后 | 2 | 0.6% |
2 | 0.6% | |
신씨 | 2 | 0.6% |
金氏 | 2 | 0.6% |
정순왕후 | 2 | 0.6% |
아내 | 2 | 0.6% |
이숙희(李淑禧 | 1 | 0.3% |
Other values (286) | 286 |
Most occurring characters
Value | Count | Frequency (%) |
) | 201 | 8.0% |
( | 201 | 8.0% |
1 | 156 | 6.2% |
9 | 123 | 4.9% |
114 | 4.5% | |
~ | 66 | 2.6% |
8 | 47 | 1.9% |
후 | 44 | 1.7% |
0 | 44 | 1.7% |
왕 | 38 | 1.5% |
Other values (418) | 1492 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 1439 | |
Decimal Number | 503 | 19.9% |
Close Punctuation | 201 | 8.0% |
Open Punctuation | 201 | 8.0% |
Space Separator | 114 | 4.5% |
Math Symbol | 66 | 2.6% |
Other Punctuation | 2 | 0.1% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
후 | 44 | 3.1% |
왕 | 38 | 2.6% |
씨 | 31 | 2.2% |
의 | 29 | 2.0% |
인 | 27 | 1.9% |
주 | 25 | 1.7% |
后 | 25 | 1.7% |
王 | 24 | 1.7% |
정 | 23 | 1.6% |
부 | 22 | 1.5% |
Other values (402) | 1151 |
Decimal Number
Value | Count | Frequency (%) |
1 | 156 | |
9 | 123 | |
8 | 47 | 9.3% |
0 | 44 | 8.7% |
2 | 33 | 6.6% |
3 | 26 | 5.2% |
7 | 21 | 4.2% |
4 | 19 | 3.8% |
5 | 19 | 3.8% |
6 | 15 | 3.0% |
Other Punctuation
Value | Count | Frequency (%) |
· | 1 | |
. | 1 |
Close Punctuation
Value | Count | Frequency (%) |
) | 201 |
Open Punctuation
Value | Count | Frequency (%) |
( | 201 |
Space Separator
Value | Count | Frequency (%) |
114 |
Math Symbol
Value | Count | Frequency (%) |
~ | 66 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 1087 | |
Hangul | 964 | |
Han | 475 |
Most frequent character per script
Han
Value | Count | Frequency (%) |
后 | 25 | 5.3% |
王 | 24 | 5.1% |
氏 | 20 | 4.2% |
主 | 15 | 3.2% |
人 | 14 | 2.9% |
夫 | 12 | 2.5% |
太 | 10 | 2.1% |
德 | 7 | 1.5% |
公 | 6 | 1.3% |
大 | 6 | 1.3% |
Other values (210) | 336 |
Hangul
Value | Count | Frequency (%) |
후 | 44 | 4.6% |
왕 | 38 | 3.9% |
씨 | 31 | 3.2% |
의 | 29 | 3.0% |
인 | 27 | 2.8% |
주 | 25 | 2.6% |
정 | 23 | 2.4% |
부 | 22 | 2.3% |
이 | 21 | 2.2% |
김 | 19 | 2.0% |
Other values (182) | 685 |
Common
Value | Count | Frequency (%) |
) | 201 | |
( | 201 | |
1 | 156 | |
9 | 123 | |
114 | ||
~ | 66 | 6.1% |
8 | 47 | 4.3% |
0 | 44 | 4.0% |
2 | 33 | 3.0% |
3 | 26 | 2.4% |
Other values (6) | 76 | 7.0% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1086 | |
Hangul | 964 | |
CJK | 457 | |
CJK Compat Ideographs | 18 | 0.7% |
None | 1 | < 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
) | 201 | |
( | 201 | |
1 | 156 | |
9 | 123 | |
114 | ||
~ | 66 | 6.1% |
8 | 47 | 4.3% |
0 | 44 | 4.1% |
2 | 33 | 3.0% |
3 | 26 | 2.4% |
Other values (5) | 75 | 6.9% |
Hangul
Value | Count | Frequency (%) |
후 | 44 | 4.6% |
왕 | 38 | 3.9% |
씨 | 31 | 3.2% |
의 | 29 | 3.0% |
인 | 27 | 2.8% |
주 | 25 | 2.6% |
정 | 23 | 2.4% |
부 | 22 | 2.3% |
이 | 21 | 2.2% |
김 | 19 | 2.0% |
Other values (182) | 685 |
CJK
Value | Count | Frequency (%) |
后 | 25 | 5.5% |
王 | 24 | 5.3% |
氏 | 20 | 4.4% |
主 | 15 | 3.3% |
人 | 14 | 3.1% |
夫 | 12 | 2.6% |
太 | 10 | 2.2% |
德 | 7 | 1.5% |
公 | 6 | 1.3% |
大 | 6 | 1.3% |
Other values (198) | 318 |
CJK Compat Ideographs
Value | Count | Frequency (%) |
金 | 5 | |
李 | 2 | 11.1% |
寧 | 2 | 11.1% |
烈 | 1 | 5.6% |
麗 | 1 | 5.6% |
廉 | 1 | 5.6% |
宅 | 1 | 5.6% |
蘭 | 1 | 5.6% |
林 | 1 | 5.6% |
樂 | 1 | 5.6% |
Other values (2) | 2 | 11.1% |
None
Value | Count | Frequency (%) |
· | 1 |
인물연구실명
Categorical
HIGH CORRELATION
 
Distinct | 5 |
---|---|
Distinct (%) | 2.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.0 KiB |
고려 | |
---|---|
고대 | |
조선 | |
일제강점기 | |
현대 |
Length
Max length | 5 |
---|---|
Median length | 2 |
Mean length | 2.5975104 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 고려 |
---|---|
2nd row | 고대 |
3rd row | 고대 |
4th row | 일제강점기 |
5th row | 조선 |
Common Values
Value | Count | Frequency (%) |
고려 | 59 | |
고대 | 58 | |
조선 | 57 | |
일제강점기 | 48 | |
현대 | 19 | 7.9% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
고려 | 59 | |
고대 | 58 | |
조선 | 57 | |
일제강점기 | 48 | |
현대 | 19 | 7.9% |
등록일자
Categorical
HIGH CORRELATION
 
Distinct | 2 |
---|---|
Distinct (%) | 0.8% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.0 KiB |
2019-09-09 | |
---|---|
2019-09-06 |
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2019-09-09 |
---|---|
2nd row | 2019-09-06 |
3rd row | 2019-09-06 |
4th row | 2019-09-09 |
5th row | 2019-09-09 |
Common Values
Value | Count | Frequency (%) |
2019-09-09 | 183 | |
2019-09-06 | 58 | 24.1% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2019-09-09 | 183 | |
2019-09-06 | 58 | 24.1% |
데이터기준일자
Categorical
CONSTANT
 
Distinct | 1 |
---|---|
Distinct (%) | 0.4% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 2.0 KiB |
2021-08-06 |
---|
Length
Max length | 10 |
---|---|
Median length | 10 |
Mean length | 10 |
Min length | 10 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 2021-08-06 |
---|---|
2nd row | 2021-08-06 |
3rd row | 2021-08-06 |
4th row | 2021-08-06 |
5th row | 2021-08-06 |
Common Values
Value | Count | Frequency (%) |
2021-08-06 | 241 |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
2021-08-06 | 241 |
인물연구실명 | 등록일자 | |
---|---|---|
인물연구실명 | 1.000 | 1.000 |
등록일자 | 1.000 | 1.000 |
인물연구실명 | 등록일자 | |
---|---|---|
인물연구실명 | 1.000 | 0.994 |
등록일자 | 0.994 | 1.000 |
인물연구실명 | 등록일자 | |
---|---|---|
인물연구실명 | 1.000 | 0.994 |
등록일자 | 0.994 | 1.000 |
인물연구명 | 인물연구실명 | 등록일자 | 데이터기준일자 | |
---|---|---|---|---|
0 | 231. 평량(平亮)의 처 | 고려 | 2019-09-09 | 2021-08-06 |
1 | 가야 용녀(傭女) | 고대 | 2019-09-06 | 2021-08-06 |
2 | 가야의 이뇌왕비(異腦王妃) | 고대 | 2019-09-06 | 2021-08-06 |
3 | 강경애(1906~1944) | 일제강점기 | 2019-09-09 | 2021-08-06 |
4 | 강빈 | 조선 | 2019-09-09 | 2021-08-06 |
5 | 강수(强首)의 처 | 고대 | 2019-09-06 | 2021-08-06 |
6 | 강신재(1924~2001) | 현대 | 2019-09-09 | 2021-08-06 |
7 | 강완숙(姜完淑) | 조선 | 2019-09-09 | 2021-08-06 |
8 | 강은교(1945~ ) | 현대 | 2019-09-09 | 2021-08-06 |
9 | 강정일당(姜靜一堂) | 조선 | 2019-09-09 | 2021-08-06 |
인물연구명 | 인물연구실명 | 등록일자 | 데이터기준일자 | |
---|---|---|---|---|
231 | 헌정왕후(獻貞王后) | 고려 | 2019-09-09 | 2021-08-06 |
232 | 현덕왕후 | 조선 | 2019-09-09 | 2021-08-06 |
233 | 현문혁(玄文弈)의 처 | 고려 | 2019-09-09 | 2021-08-06 |
234 | 혜명왕후 | 고대 | 2019-09-06 | 2021-08-06 |
235 | 화순옹주(和順翁主) | 조선 | 2019-09-09 | 2021-08-06 |
236 | 화완옹주(和緩翁主) | 조선 | 2019-09-09 | 2021-08-06 |
237 | 황애덕(1892~1971) | 일제강점기 | 2019-09-09 | 2021-08-06 |
238 | 황진이 | 조선 | 2019-09-09 | 2021-08-06 |
239 | 효녀 지은 | 고대 | 2019-09-06 | 2021-08-06 |
240 | 희명(希明) | 고대 | 2019-09-06 | 2021-08-06 |
Most frequently occurring
인물연구명 | 인물연구실명 | 등록일자 | 데이터기준일자 | # duplicates | |
---|---|---|---|---|---|
0 | 원덕태후(元德太后) | 고려 | 2019-09-09 | 2021-08-06 | 2 |