Dataset statistics
Number of variables | 5 |
---|---|
Number of observations | 1066 |
Missing cells | 0 |
Missing cells (%) | 0.0% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 42.8 KiB |
Average record size in memory | 41.1 B |
Variable types
Text | 2 |
---|---|
Categorical | 1 |
Numeric | 1 |
DateTime | 1 |
Dataset
Description | 독립기념관 국외 독립운동사적지의 구분, 등록순번, 내용, 등록일자 등의 자료입니다. |
---|---|
Author | 독립기념관 |
URL | https://www.data.go.kr/data/15067832/fileData.do |
Reproduction
Analysis started | 2023-12-12 23:25:47.191887 |
---|---|
Analysis finished | 2023-12-12 23:25:47.706901 |
Duration | 0.52 seconds |
Software version | ydata-profiling vv4.5.1 |
Download configuration | config.json |
관리번호
Text
Distinct | 607 |
---|---|
Distinct (%) | 56.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 8.5 KiB |
Length
Max length | 12 |
---|---|
Median length | 7 |
Mean length | 7.3377111 |
Min length | 7 |
Characters and Unicode
Total characters | 7822 |
---|---|
Distinct characters | 33 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 366 ? |
---|---|
Unique (%) | 34.3% |
Sample
1st row | 1-01-13-0004 |
---|---|
2nd row | 1-04-33-0001 |
3rd row | 1-04-39-0001 |
4th row | 2-01-13-0004 |
5th row | 2-01-13-0005 |
Value | Count | Frequency (%) |
2-01-30-0001 | 19 | 1.8% |
2-01-30-0002 | 12 | 1.1% |
cn00400 | 8 | 0.8% |
ru00104 | 8 | 0.8% |
cn00013 | 8 | 0.8% |
cn00058 | 7 | 0.7% |
ru00105 | 7 | 0.7% |
cn00076 | 7 | 0.7% |
be00001 | 6 | 0.6% |
cn00124 | 6 | 0.6% |
Other values (597) | 978 |
Most occurring characters
Value | Count | Frequency (%) |
0 | 3101 | |
C | 601 | 7.7% |
N | 600 | 7.7% |
1 | 599 | 7.7% |
2 | 445 | 5.7% |
3 | 385 | 4.9% |
U | 247 | 3.2% |
4 | 227 | 2.9% |
- | 216 | 2.8% |
5 | 204 | 2.6% |
Other values (23) | 1197 | 15.3% |
Most occurring categories
Value | Count | Frequency (%) |
Decimal Number | 5618 | |
Uppercase Letter | 1988 | 25.4% |
Dash Punctuation | 216 | 2.8% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
C | 601 | |
N | 600 | |
U | 247 | |
S | 163 | 8.2% |
R | 84 | 4.2% |
P | 46 | 2.3% |
J | 45 | 2.3% |
M | 23 | 1.2% |
I | 22 | 1.1% |
D | 20 | 1.0% |
Other values (12) | 137 | 6.9% |
Decimal Number
Value | Count | Frequency (%) |
0 | 3101 | |
1 | 599 | 10.7% |
2 | 445 | 7.9% |
3 | 385 | 6.9% |
4 | 227 | 4.0% |
5 | 204 | 3.6% |
6 | 187 | 3.3% |
9 | 170 | 3.0% |
8 | 166 | 3.0% |
7 | 134 | 2.4% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 216 |
Most occurring scripts
Value | Count | Frequency (%) |
Common | 5834 | |
Latin | 1988 | 25.4% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
C | 601 | |
N | 600 | |
U | 247 | |
S | 163 | 8.2% |
R | 84 | 4.2% |
P | 46 | 2.3% |
J | 45 | 2.3% |
M | 23 | 1.2% |
I | 22 | 1.1% |
D | 20 | 1.0% |
Other values (12) | 137 | 6.9% |
Common
Value | Count | Frequency (%) |
0 | 3101 | |
1 | 599 | 10.3% |
2 | 445 | 7.6% |
3 | 385 | 6.6% |
4 | 227 | 3.9% |
- | 216 | 3.7% |
5 | 204 | 3.5% |
6 | 187 | 3.2% |
9 | 170 | 2.9% |
8 | 166 | 2.8% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 7822 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
0 | 3101 | |
C | 601 | 7.7% |
N | 600 | 7.7% |
1 | 599 | 7.7% |
2 | 445 | 5.7% |
3 | 385 | 4.9% |
U | 247 | 3.2% |
4 | 227 | 2.9% |
- | 216 | 2.8% |
5 | 204 | 2.6% |
Other values (23) | 1197 | 15.3% |
구분
Categorical
Distinct | 3 |
---|---|
Distinct (%) | 0.3% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 8.5 KiB |
인물 | |
---|---|
단체 | |
사건 | 47 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 2 |
Min length | 2 |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | 단체 |
---|---|
2nd row | 인물 |
3rd row | 인물 |
4th row | 인물 |
5th row | 단체 |
Common Values
Value | Count | Frequency (%) |
인물 | 697 | |
단체 | 322 | |
사건 | 47 | 4.4% |
Length
Common Values (Plot)
Value | Count | Frequency (%) |
인물 | 697 | |
단체 | 322 | |
사건 | 47 | 4.4% |
등록순번
Real number (ℝ)
Distinct | 11 |
---|---|
Distinct (%) | 1.0% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 1.5375235 |
Minimum | 1 |
---|---|
Maximum | 11 |
Zeros | 0 |
Zeros (%) | 0.0% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 9.5 KiB |
Quantile statistics
Minimum | 1 |
---|---|
5-th percentile | 1 |
Q1 | 1 |
median | 1 |
Q3 | 2 |
95-th percentile | 4 |
Maximum | 11 |
Range | 10 |
Interquartile range (IQR) | 1 |
Descriptive statistics
Standard deviation | 1.1981986 |
---|---|
Coefficient of variation (CV) | 0.77930427 |
Kurtosis | 13.77072 |
Mean | 1.5375235 |
Median Absolute Deviation (MAD) | 0 |
Skewness | 3.3073387 |
Sum | 1639 |
Variance | 1.4356799 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
1 | 782 | |
2 | 150 | 14.1% |
3 | 67 | 6.3% |
4 | 29 | 2.7% |
5 | 15 | 1.4% |
6 | 9 | 0.8% |
7 | 7 | 0.7% |
8 | 4 | 0.4% |
9 | 1 | 0.1% |
10 | 1 | 0.1% |
Value | Count | Frequency (%) |
1 | 782 | |
2 | 150 | 14.1% |
3 | 67 | 6.3% |
4 | 29 | 2.7% |
5 | 15 | 1.4% |
6 | 9 | 0.8% |
7 | 7 | 0.7% |
8 | 4 | 0.4% |
9 | 1 | 0.1% |
10 | 1 | 0.1% |
Value | Count | Frequency (%) |
11 | 1 | 0.1% |
10 | 1 | 0.1% |
9 | 1 | 0.1% |
8 | 4 | 0.4% |
7 | 7 | 0.7% |
6 | 9 | 0.8% |
5 | 15 | 1.4% |
4 | 29 | 2.7% |
3 | 67 | |
2 | 150 |
내용
Text
Distinct | 425 |
---|---|
Distinct (%) | 39.9% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 8.5 KiB |
Value | Count | Frequency (%) |
대한민국임시정부 | 38 | 3.5% |
조선의용대(군 | 32 | 3.0% |
대한인국민회 | 27 | 2.5% |
한국광복군 | 26 | 2.4% |
이승만 | 23 | 2.1% |
김좌진 | 20 | 1.8% |
홍범도 | 18 | 1.7% |
3·1운동 | 16 | 1.5% |
신채호 | 15 | 1.4% |
서재필 | 14 | 1.3% |
Other values (400) | 855 |
Most occurring characters
Value | Count | Frequency (%) |
한 | 183 | 4.3% |
대 | 164 | 3.8% |
김 | 154 | 3.6% |
이 | 143 | 3.3% |
국 | 140 | 3.3% |
민 | 114 | 2.7% |
회 | 98 | 2.3% |
정 | 95 | 2.2% |
조 | 84 | 2.0% |
군 | 84 | 2.0% |
Other values (229) | 3023 |
Most occurring categories
Value | Count | Frequency (%) |
Other Letter | 4076 | |
Space Separator | 75 | 1.8% |
Decimal Number | 47 | 1.1% |
Open Punctuation | 32 | 0.7% |
Close Punctuation | 32 | 0.7% |
Other Punctuation | 20 | 0.5% |
Most frequent character per category
Other Letter
Value | Count | Frequency (%) |
한 | 183 | 4.5% |
대 | 164 | 4.0% |
김 | 154 | 3.8% |
이 | 143 | 3.5% |
국 | 140 | 3.4% |
민 | 114 | 2.8% |
회 | 98 | 2.4% |
정 | 95 | 2.3% |
조 | 84 | 2.1% |
군 | 84 | 2.1% |
Other values (219) | 2817 |
Decimal Number
Value | Count | Frequency (%) |
1 | 21 | |
3 | 17 | |
5 | 3 | 6.4% |
2 | 3 | 6.4% |
8 | 3 | 6.4% |
Other Punctuation
Value | Count | Frequency (%) |
· | 19 | |
. | 1 | 5.0% |
Space Separator
Value | Count | Frequency (%) |
75 |
Open Punctuation
Value | Count | Frequency (%) |
( | 32 |
Close Punctuation
Value | Count | Frequency (%) |
) | 32 |
Most occurring scripts
Value | Count | Frequency (%) |
Hangul | 4076 | |
Common | 206 | 4.8% |
Most frequent character per script
Hangul
Value | Count | Frequency (%) |
한 | 183 | 4.5% |
대 | 164 | 4.0% |
김 | 154 | 3.8% |
이 | 143 | 3.5% |
국 | 140 | 3.4% |
민 | 114 | 2.8% |
회 | 98 | 2.4% |
정 | 95 | 2.3% |
조 | 84 | 2.1% |
군 | 84 | 2.1% |
Other values (219) | 2817 |
Common
Value | Count | Frequency (%) |
75 | ||
( | 32 | |
) | 32 | |
1 | 21 | 10.2% |
· | 19 | 9.2% |
3 | 17 | 8.3% |
5 | 3 | 1.5% |
2 | 3 | 1.5% |
8 | 3 | 1.5% |
. | 1 | 0.5% |
Most occurring blocks
Value | Count | Frequency (%) |
Hangul | 4076 | |
ASCII | 187 | 4.4% |
None | 19 | 0.4% |
Most frequent character per block
Hangul
Value | Count | Frequency (%) |
한 | 183 | 4.5% |
대 | 164 | 4.0% |
김 | 154 | 3.8% |
이 | 143 | 3.5% |
국 | 140 | 3.4% |
민 | 114 | 2.8% |
회 | 98 | 2.4% |
정 | 95 | 2.3% |
조 | 84 | 2.1% |
군 | 84 | 2.1% |
Other values (219) | 2817 |
ASCII
Value | Count | Frequency (%) |
75 | ||
( | 32 | |
) | 32 | |
1 | 21 | 11.2% |
3 | 17 | 9.1% |
5 | 3 | 1.6% |
2 | 3 | 1.6% |
8 | 3 | 1.6% |
. | 1 | 0.5% |
None
Value | Count | Frequency (%) |
· | 19 |
등록일자
Date
Distinct | 453 |
---|---|
Distinct (%) | 42.5% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 8.5 KiB |
Minimum | 2015-12-16 19:01:00 |
---|---|
Maximum | 2020-08-03 14:44:00 |
구분 | 등록순번 | |
---|---|---|
구분 | 1.000 | 0.318 |
등록순번 | 0.318 | 1.000 |
등록순번 | 구분 | |
---|---|---|
등록순번 | 1.000 | 0.139 |
구분 | 0.139 | 1.000 |
관리번호 | 구분 | 등록순번 | 내용 | 등록일자 | |
---|---|---|---|---|---|
0 | 1-01-13-0004 | 단체 | 1 | 대한민국임시정부 | 2020-02-24 13:33 |
1 | 1-04-33-0001 | 인물 | 1 | 윤봉길 | 2018-07-06 14:24 |
2 | 1-04-39-0001 | 인물 | 1 | 윤봉길 | 2018-05-10 15:11 |
3 | 2-01-13-0004 | 인물 | 1 | 김시문 | 2019-12-31 13:45 |
4 | 2-01-13-0005 | 단체 | 1 | 흥사단 | 2019-12-31 14:05 |
5 | 2-01-13-0006 | 단체 | 1 | 대한민국임시정부 | 2019-12-31 13:44 |
6 | 2-01-13-0006 | 사건 | 1 | 3.1독립선언일 | 2019-12-31 13:44 |
7 | 2-01-13-0006 | 인물 | 1 | 이동녕 | 2019-12-31 13:44 |
8 | 2-01-13-0006 | 인물 | 2 | 안창호 | 2019-12-31 13:44 |
9 | 2-01-13-0007 | 인물 | 1 | 김성숙 | 2019-12-31 14:06 |
관리번호 | 구분 | 등록순번 | 내용 | 등록일자 | |
---|---|---|---|---|---|
1056 | US00140 | 인물 | 1 | 이승만 | 2019-12-02 16:06 |
1057 | US00143 | 단체 | 1 | 대한인국민회 | 2018-05-23 17:06 |
1058 | US00146 | 단체 | 1 | 대한인국민회 | 2018-05-23 17:36 |
1059 | US00147 | 단체 | 1 | 대한인국민회 | 2018-05-23 17:49 |
1060 | US00150 | 인물 | 1 | 김경 | 2018-05-16 11:02 |
1061 | UZ00001 | 인물 | 1 | 조명희 | 2018-02-06 16:22 |
1062 | UZ00003 | 인물 | 1 | 이인섭 | 2018-02-06 15:14 |
1063 | UZ00004 | 인물 | 1 | 이인섭 | 2018-02-06 15:12 |
1064 | UZ00005 | 인물 | 1 | 이인섭 | 2018-02-06 15:13 |
1065 | UZ00006 | 인물 | 1 | 김병화 | 2018-02-06 15:09 |