Dataset statistics
Number of variables | 3 |
---|---|
Number of observations | 1000 |
Missing cells | 1001 |
Missing cells (%) | 33.4% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 25.5 KiB |
Average record size in memory | 26.1 B |
Variable types
Numeric | 1 |
---|---|
Text | 1 |
Unsupported | 1 |
Dataset
Description | 현대한국구술자료관 구술자료와 관련된 연혁 정보 |
---|---|
Author | 한국학중앙연구원 |
URL | https://www.data.go.kr/data/15049074/fileData.do |
Unnamed: 2 has 1000 (100.0%) missing values | Missing |
번호 has unique values | Unique |
Unnamed: 2 is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
Reproduction
Analysis started | 2023-12-12 15:33:17.551482 |
---|---|
Analysis finished | 2023-12-12 15:33:17.935238 |
Duration | 0.38 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
번호
Real number (ℝ)
UNIQUE
 
Distinct | 1000 |
---|---|
Distinct (%) | 100.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1103.335 |
Minimum | 277 |
---|---|
Maximum | 1760 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 8.9 KiB |
Quantile statistics
Minimum | 277 |
---|---|
5-th percentile | 593.85 |
Q1 | 799.75 |
median | 1141.5 |
Q3 | 1423.25 |
95-th percentile | 1623.05 |
Maximum | 1760 |
Range | 1483 |
Interquartile range (IQR) | 623.5 |
Descriptive statistics
Standard deviation | 365.84425 |
---|---|
Coefficient of variation (CV) | 0.33158039 |
Kurtosis | -0.96915083 |
Mean | 1103.335 |
Median Absolute Deviation (MAD) | 312 |
Skewness | -0.15897445 |
Sum | 1103335 |
Variance | 133842.01 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1639 | 1 | 0.1% |
904 | 1 | 0.1% |
891 | 1 | 0.1% |
892 | 1 | 0.1% |
893 | 1 | 0.1% |
894 | 1 | 0.1% |
895 | 1 | 0.1% |
896 | 1 | 0.1% |
897 | 1 | 0.1% |
898 | 1 | 0.1% |
Other values (990) | 990 |
Value | Count | Frequency (%) |
277 | 1 | |
278 | 1 | |
279 | 1 | |
280 | 1 | |
281 | 1 | |
282 | 1 | |
283 | 1 | |
284 | 1 | |
285 | 1 | |
286 | 1 |
Value | Count | Frequency (%) |
1760 | 1 | |
1759 | 1 | |
1758 | 1 | |
1757 | 1 | |
1756 | 1 | |
1755 | 1 | |
1754 | 1 | |
1753 | 1 | |
1752 | 1 | |
1751 | 1 |
연도
Text
Distinct | 591 |
---|---|
Distinct (%) | 59.2% |
Missing | 1 |
Missing (%) | 0.1% |
Memory size | 7.9 KiB |
Length
Max length | 32 |
---|---|
Median length | 22 |
Mean length | 7.4574575 |
Min length | 1 |
Characters and Unicode
Total characters | 7450 |
---|---|
Distinct characters | 40 |
Distinct categories | 11 ? |
Distinct scripts | 3 ? |
Distinct blocks | 4 ? |
Unique
Unique | 446 ? |
---|---|
Unique (%) | 44.6% |
Sample
1st row | 1972~1979 |
---|---|
2nd row | 1983. 10~1989. 7 |
3rd row | 1989.10~ |
4th row | 1997~ |
5th row | 1997~ |
Value | Count | Frequency (%) |
368 | 23.8% | |
1973 | 23 | 1.5% |
1988 | 22 | 1.4% |
2008 | 22 | 1.4% |
1991 | 20 | 1.3% |
1997 | 20 | 1.3% |
1990 | 19 | 1.2% |
1974 | 19 | 1.2% |
1956 | 18 | 1.2% |
1979 | 17 | 1.1% |
Other values (413) | 1000 |
Most occurring characters
Value | Count | Frequency (%) |
9 | 1249 | |
1 | 1114 | |
0 | 991 | |
597 | ||
2 | 570 | |
- | 472 | 6.3% |
. | 449 | 6.0% |
8 | 360 | 4.8% |
7 | 337 | 4.5% |
6 | 298 | 4.0% |
Other values (30) | 1013 |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 5520 | |
Space Separator | 671 | 9.0% |
Other Punctuation | 523 | 7.0% |
Dash Punctuation | 472 | 6.3% |
Other Letter | 148 | 2.0% |
Math Symbol | 89 | 1.2% |
Lowercase Letter | 20 | 0.3% |
Open Punctuation | 2 | < 0.1% |
Close Punctuation | 2 | < 0.1% |
Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Decimal Number
Value | Count | Frequency (%) |
9 | 1249 | |
1 | 1114 | |
0 | 991 | |
2 | 570 | |
8 | 360 | 6.5% |
7 | 337 | 6.1% |
6 | 298 | 5.4% |
5 | 227 | 4.1% |
3 | 220 | 4.0% |
4 | 154 | 2.8% |
Other Letter
Value | Count | Frequency (%) |
년 | 66 | |
현 | 39 | |
재 | 37 | |
대 | 2 | 1.4% |
동 | 1 | 0.7% |
활 | 1 | 0.7% |
타 | 1 | 0.7% |
기 | 1 | 0.7% |
Lowercase Letter
Value | Count | Frequency (%) |
n | 5 | |
p | 4 | |
s | 4 | |
b | 4 | |
a | 2 | 10.0% |
y | 1 | 5.0% |
Other Punctuation
Value | Count | Frequency (%) |
. | 449 | |
/ | 65 | 12.4% |
; | 4 | 0.8% |
& | 4 | 0.8% |
, | 1 | 0.2% |
Math Symbol
Value | Count | Frequency (%) |
~ | 75 | |
∼ | 10 | 11.2% |
~ | 4 | 4.5% |
Space Separator
Value | Count | Frequency (%) |
597 | ||
74 | 11.0% |
Uppercase Letter
Value | Count | Frequency (%) |
J | 1 | |
M | 1 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 472 |
Open Punctuation
Value | Count | Frequency (%) |
( | 2 |
Close Punctuation
Value | Count | Frequency (%) |
) | 2 |
Control
Value | Count | Frequency (%) |
1 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 7280 | |
Hangul | 148 | 2.0% |
Latin | 22 | 0.3% |
Most frequent character per script
Common
Value | Count | Frequency (%) |
9 | 1249 | |
1 | 1114 | |
0 | 991 | |
597 | ||
2 | 570 | |
- | 472 | 6.5% |
. | 449 | 6.2% |
8 | 360 | 4.9% |
7 | 337 | 4.6% |
6 | 298 | 4.1% |
Other values (14) | 843 |
Hangul
Value | Count | Frequency (%) |
년 | 66 | |
현 | 39 | |
재 | 37 | |
대 | 2 | 1.4% |
동 | 1 | 0.7% |
활 | 1 | 0.7% |
타 | 1 | 0.7% |
기 | 1 | 0.7% |
Latin
Value | Count | Frequency (%) |
n | 5 | |
p | 4 | |
s | 4 | |
b | 4 | |
a | 2 | 9.1% |
J | 1 | 4.5% |
M | 1 | 4.5% |
y | 1 | 4.5% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 7214 | |
Hangul | 148 | 2.0% |
None | 78 | 1.0% |
Math Operators | 10 | 0.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
9 | 1249 | |
1 | 1114 | |
0 | 991 | |
597 | ||
2 | 570 | |
- | 472 | 6.5% |
. | 449 | 6.2% |
8 | 360 | 5.0% |
7 | 337 | 4.7% |
6 | 298 | 4.1% |
Other values (19) | 777 |
None
Value | Count | Frequency (%) |
74 | ||
~ | 4 | 5.1% |
Hangul
Value | Count | Frequency (%) |
년 | 66 | |
현 | 39 | |
재 | 37 | |
대 | 2 | 1.4% |
동 | 1 | 0.7% |
활 | 1 | 0.7% |
타 | 1 | 0.7% |
기 | 1 | 0.7% |
Math Operators
Value | Count | Frequency (%) |
∼ | 10 |
Unnamed: 2
Unsupported
MISSING
  REJECTED
  UNSUPPORTED
 
Missing | 1000 |
---|---|
Missing (%) | 100.0% |
Memory size | 8.9 KiB |
번호 | 연도 | Unnamed: 2 | |
---|---|---|---|
0 | 1639 | 1972~1979 | <NA> |
1 | 1640 | 1983. 10~1989. 7 | <NA> |
2 | 1641 | 1989.10~ | <NA> |
3 | 1642 | 1997~ | <NA> |
4 | 1643 | 1997~ | <NA> |
5 | 1750 | 1992-1995 | <NA> |
6 | 1751 | 1996-1997 | <NA> |
7 | 1752 | 1998-2000 | <NA> |
8 | 1753 | 1999-2003 | <NA> |
9 | 1754 | 2002 | <NA> |
번호 | 연도 | Unnamed: 2 | |
---|---|---|---|
990 | 599 | 1959-1964 | <NA> |
991 | 600 | 1962-1968 | <NA> |
992 | 601 | 1962-1966 | <NA> |
993 | 602 | 1965-1990 | <NA> |
994 | 603 | 1976-1979 | <NA> |
995 | 604 | 1982-현재 | <NA> |
996 | 588 | 1988 | <NA> |
997 | 589 | 1991 | <NA> |
998 | 590 | 1995-현재 | <NA> |
999 | 591 | 2008-현재 | <NA> |