Overview

Dataset statistics

Number of variables5
Number of observations1066
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory42.8 KiB
Average record size in memory41.1 B

Variable types

Text2
Categorical1
Numeric1
DateTime1

Dataset

Description독립기념관 국외 독립운동사적지의 구분, 등록순번, 내용, 등록일자 등의 자료입니다.
Author독립기념관
URLhttps://www.data.go.kr/data/15067832/fileData.do

Reproduction

Analysis started2023-12-12 23:25:47.191887
Analysis finished2023-12-12 23:25:47.706901
Duration0.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct607
Distinct (%)56.9%
Missing0
Missing (%)0.0%
Memory size8.5 KiB
2023-12-13T08:25:47.951514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length12
Median length7
Mean length7.3377111
Min length7

Characters and Unicode

Total characters7822
Distinct characters33
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique366 ?
Unique (%)34.3%

Sample

1st row1-01-13-0004
2nd row1-04-33-0001
3rd row1-04-39-0001
4th row2-01-13-0004
5th row2-01-13-0005
ValueCountFrequency (%)
2-01-30-0001 19
 
1.8%
2-01-30-0002 12
 
1.1%
cn00400 8
 
0.8%
ru00104 8
 
0.8%
cn00013 8
 
0.8%
cn00058 7
 
0.7%
ru00105 7
 
0.7%
cn00076 7
 
0.7%
be00001 6
 
0.6%
cn00124 6
 
0.6%
Other values (597) 978
91.7%
2023-12-13T08:25:48.358538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 3101
39.6%
C 601
 
7.7%
N 600
 
7.7%
1 599
 
7.7%
2 445
 
5.7%
3 385
 
4.9%
U 247
 
3.2%
4 227
 
2.9%
- 216
 
2.8%
5 204
 
2.6%
Other values (23) 1197
 
15.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5618
71.8%
Uppercase Letter 1988
 
25.4%
Dash Punctuation 216
 
2.8%

Most frequent character per category

Uppercase Letter
ValueCountFrequency (%)
C 601
30.2%
N 600
30.2%
U 247
12.4%
S 163
 
8.2%
R 84
 
4.2%
P 46
 
2.3%
J 45
 
2.3%
M 23
 
1.2%
I 22
 
1.1%
D 20
 
1.0%
Other values (12) 137
 
6.9%
Decimal Number
ValueCountFrequency (%)
0 3101
55.2%
1 599
 
10.7%
2 445
 
7.9%
3 385
 
6.9%
4 227
 
4.0%
5 204
 
3.6%
6 187
 
3.3%
9 170
 
3.0%
8 166
 
3.0%
7 134
 
2.4%
Dash Punctuation
ValueCountFrequency (%)
- 216
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5834
74.6%
Latin 1988
 
25.4%

Most frequent character per script

Latin
ValueCountFrequency (%)
C 601
30.2%
N 600
30.2%
U 247
12.4%
S 163
 
8.2%
R 84
 
4.2%
P 46
 
2.3%
J 45
 
2.3%
M 23
 
1.2%
I 22
 
1.1%
D 20
 
1.0%
Other values (12) 137
 
6.9%
Common
ValueCountFrequency (%)
0 3101
53.2%
1 599
 
10.3%
2 445
 
7.6%
3 385
 
6.6%
4 227
 
3.9%
- 216
 
3.7%
5 204
 
3.5%
6 187
 
3.2%
9 170
 
2.9%
8 166
 
2.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 7822
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 3101
39.6%
C 601
 
7.7%
N 600
 
7.7%
1 599
 
7.7%
2 445
 
5.7%
3 385
 
4.9%
U 247
 
3.2%
4 227
 
2.9%
- 216
 
2.8%
5 204
 
2.6%
Other values (23) 1197
 
15.3%

구분
Categorical

Distinct3
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size8.5 KiB
인물
697 
단체
322 
사건
 
47

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row단체
2nd row인물
3rd row인물
4th row인물
5th row단체

Common Values

ValueCountFrequency (%)
인물 697
65.4%
단체 322
30.2%
사건 47
 
4.4%

Length

2023-12-13T08:25:48.508016image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-13T08:25:48.600184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
인물 697
65.4%
단체 322
30.2%
사건 47
 
4.4%

등록순번
Real number (ℝ)

Distinct11
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.5375235
Minimum1
Maximum11
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size9.5 KiB
2023-12-13T08:25:48.682235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q11
median1
Q32
95-th percentile4
Maximum11
Range10
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.1981986
Coefficient of variation (CV)0.77930427
Kurtosis13.77072
Mean1.5375235
Median Absolute Deviation (MAD)0
Skewness3.3073387
Sum1639
Variance1.4356799
MonotonicityNot monotonic
2023-12-13T08:25:48.781417image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
1 782
73.4%
2 150
 
14.1%
3 67
 
6.3%
4 29
 
2.7%
5 15
 
1.4%
6 9
 
0.8%
7 7
 
0.7%
8 4
 
0.4%
9 1
 
0.1%
10 1
 
0.1%
ValueCountFrequency (%)
1 782
73.4%
2 150
 
14.1%
3 67
 
6.3%
4 29
 
2.7%
5 15
 
1.4%
6 9
 
0.8%
7 7
 
0.7%
8 4
 
0.4%
9 1
 
0.1%
10 1
 
0.1%
ValueCountFrequency (%)
11 1
 
0.1%
10 1
 
0.1%
9 1
 
0.1%
8 4
 
0.4%
7 7
 
0.7%
6 9
 
0.8%
5 15
 
1.4%
4 29
 
2.7%
3 67
6.3%
2 150
14.1%

내용
Text

Distinct425
Distinct (%)39.9%
Missing0
Missing (%)0.0%
Memory size8.5 KiB
2023-12-13T08:25:49.033250image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length3
Mean length4.0168856
Min length2

Characters and Unicode

Total characters4282
Distinct characters239
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique246 ?
Unique (%)23.1%

Sample

1st row대한민국임시정부
2nd row윤봉길
3rd row윤봉길
4th row김시문
5th row흥사단
ValueCountFrequency (%)
대한민국임시정부 38
 
3.5%
조선의용대(군 32
 
3.0%
대한인국민회 27
 
2.5%
한국광복군 26
 
2.4%
이승만 23
 
2.1%
김좌진 20
 
1.8%
홍범도 18
 
1.7%
3·1운동 16
 
1.5%
신채호 15
 
1.4%
서재필 14
 
1.3%
Other values (400) 855
78.9%
2023-12-13T08:25:49.466629image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
183
 
4.3%
164
 
3.8%
154
 
3.6%
143
 
3.3%
140
 
3.3%
114
 
2.7%
98
 
2.3%
95
 
2.2%
84
 
2.0%
84
 
2.0%
Other values (229) 3023
70.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4076
95.2%
Space Separator 75
 
1.8%
Decimal Number 47
 
1.1%
Open Punctuation 32
 
0.7%
Close Punctuation 32
 
0.7%
Other Punctuation 20
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
183
 
4.5%
164
 
4.0%
154
 
3.8%
143
 
3.5%
140
 
3.4%
114
 
2.8%
98
 
2.4%
95
 
2.3%
84
 
2.1%
84
 
2.1%
Other values (219) 2817
69.1%
Decimal Number
ValueCountFrequency (%)
1 21
44.7%
3 17
36.2%
5 3
 
6.4%
2 3
 
6.4%
8 3
 
6.4%
Other Punctuation
ValueCountFrequency (%)
· 19
95.0%
. 1
 
5.0%
Space Separator
ValueCountFrequency (%)
75
100.0%
Open Punctuation
ValueCountFrequency (%)
( 32
100.0%
Close Punctuation
ValueCountFrequency (%)
) 32
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4076
95.2%
Common 206
 
4.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
183
 
4.5%
164
 
4.0%
154
 
3.8%
143
 
3.5%
140
 
3.4%
114
 
2.8%
98
 
2.4%
95
 
2.3%
84
 
2.1%
84
 
2.1%
Other values (219) 2817
69.1%
Common
ValueCountFrequency (%)
75
36.4%
( 32
15.5%
) 32
15.5%
1 21
 
10.2%
· 19
 
9.2%
3 17
 
8.3%
5 3
 
1.5%
2 3
 
1.5%
8 3
 
1.5%
. 1
 
0.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4076
95.2%
ASCII 187
 
4.4%
None 19
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
183
 
4.5%
164
 
4.0%
154
 
3.8%
143
 
3.5%
140
 
3.4%
114
 
2.8%
98
 
2.4%
95
 
2.3%
84
 
2.1%
84
 
2.1%
Other values (219) 2817
69.1%
ASCII
ValueCountFrequency (%)
75
40.1%
( 32
17.1%
) 32
17.1%
1 21
 
11.2%
3 17
 
9.1%
5 3
 
1.6%
2 3
 
1.6%
8 3
 
1.6%
. 1
 
0.5%
None
ValueCountFrequency (%)
· 19
100.0%
Distinct453
Distinct (%)42.5%
Missing0
Missing (%)0.0%
Memory size8.5 KiB
Minimum2015-12-16 19:01:00
Maximum2020-08-03 14:44:00
2023-12-13T08:25:49.613482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-13T08:25:49.764948image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-13T08:25:47.442178image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-13T08:25:49.846060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
구분등록순번
구분1.0000.318
등록순번0.3181.000
2023-12-13T08:25:49.917031image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
등록순번구분
등록순번1.0000.139
구분0.1391.000

Missing values

2023-12-13T08:25:47.553483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-13T08:25:47.649706image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

관리번호구분등록순번내용등록일자
01-01-13-0004단체1대한민국임시정부2020-02-24 13:33
11-04-33-0001인물1윤봉길2018-07-06 14:24
21-04-39-0001인물1윤봉길2018-05-10 15:11
32-01-13-0004인물1김시문2019-12-31 13:45
42-01-13-0005단체1흥사단2019-12-31 14:05
52-01-13-0006단체1대한민국임시정부2019-12-31 13:44
62-01-13-0006사건13.1독립선언일2019-12-31 13:44
72-01-13-0006인물1이동녕2019-12-31 13:44
82-01-13-0006인물2안창호2019-12-31 13:44
92-01-13-0007인물1김성숙2019-12-31 14:06
관리번호구분등록순번내용등록일자
1056US00140인물1이승만2019-12-02 16:06
1057US00143단체1대한인국민회2018-05-23 17:06
1058US00146단체1대한인국민회2018-05-23 17:36
1059US00147단체1대한인국민회2018-05-23 17:49
1060US00150인물1김경2018-05-16 11:02
1061UZ00001인물1조명희2018-02-06 16:22
1062UZ00003인물1이인섭2018-02-06 15:14
1063UZ00004인물1이인섭2018-02-06 15:12
1064UZ00005인물1이인섭2018-02-06 15:13
1065UZ00006인물1김병화2018-02-06 15:09