Overview

Dataset statistics

Number of variables5
Number of observations77
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.1 KiB
Average record size in memory41.7 B

Variable types

Categorical2
DateTime1
Text2

Dataset

Description한국전통문화대학교가 국내외 유관기관과 교류협력 체결한 현황을 정리한 자료입니다. 국내외구분, 체결일자, 체결기관명, 국가, 협정명의 순서로 구성되어 있습니다.
URLhttps://www.data.go.kr/data/15104184/fileData.do

Alerts

국내외구분 is highly overall correlated with 국가High correlation
국가 is highly overall correlated with 국내외구분High correlation
국가 is highly imbalanced (67.3%)Imbalance

Reproduction

Analysis started2023-12-12 11:21:23.263580
Analysis finished2023-12-12 11:21:24.148311
Duration0.88 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

국내외구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Memory size748.0 B
국내기관
65 
국외기관
12 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row국내기관
2nd row국내기관
3rd row국내기관
4th row국내기관
5th row국내기관

Common Values

ValueCountFrequency (%)
국내기관 65
84.4%
국외기관 12
 
15.6%

Length

2023-12-12T20:21:24.239896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:21:24.370764image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
국내기관 65
84.4%
국외기관 12
 
15.6%
Distinct73
Distinct (%)94.8%
Missing0
Missing (%)0.0%
Memory size748.0 B
Minimum2004-12-21 00:00:00
Maximum2023-06-29 00:00:00
2023-12-12T20:21:24.555464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:21:24.737654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct72
Distinct (%)93.5%
Missing0
Missing (%)0.0%
Memory size748.0 B
2023-12-12T20:21:25.129415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length25
Median length18
Mean length9.4155844
Min length3

Characters and Unicode

Total characters725
Distinct characters169
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique69 ?
Unique (%)89.6%

Sample

1st row(사)한국문화재기능인협회
2nd row㈜엔가드
3rd row국립부여문화재연구소
4th row국립중앙과학관
5th row국립부여박물관
ValueCountFrequency (%)
규슈국립박물관 3
 
3.2%
부여군 3
 
3.2%
전통건축수리기술진흥재단 2
 
2.2%
2
 
2.2%
세종특별본부 1
 
1.1%
유네스코한국위원회 1
 
1.1%
충청북도문화재연구원 1
 
1.1%
경기문화재연구원 1
 
1.1%
경기문화재단 1
 
1.1%
문화재연구원 1
 
1.1%
Other values (77) 77
82.8%
2023-12-12T20:21:25.716941image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
38
 
5.2%
31
 
4.3%
31
 
4.3%
27
 
3.7%
20
 
2.8%
17
 
2.3%
17
 
2.3%
16
 
2.2%
15
 
2.1%
15
 
2.1%
Other values (159) 498
68.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 670
92.4%
Space Separator 16
 
2.2%
Open Punctuation 10
 
1.4%
Close Punctuation 10
 
1.4%
Other Symbol 8
 
1.1%
Uppercase Letter 6
 
0.8%
Other Punctuation 4
 
0.6%
Dash Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
38
 
5.7%
31
 
4.6%
31
 
4.6%
27
 
4.0%
20
 
3.0%
17
 
2.5%
17
 
2.5%
15
 
2.2%
15
 
2.2%
14
 
2.1%
Other values (147) 445
66.4%
Uppercase Letter
ValueCountFrequency (%)
C 2
33.3%
M 1
16.7%
O 1
16.7%
R 1
16.7%
I 1
16.7%
Other Punctuation
ValueCountFrequency (%)
' 2
50.0%
· 2
50.0%
Space Separator
ValueCountFrequency (%)
16
100.0%
Open Punctuation
ValueCountFrequency (%)
( 10
100.0%
Close Punctuation
ValueCountFrequency (%)
) 10
100.0%
Other Symbol
ValueCountFrequency (%)
8
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 678
93.5%
Common 41
 
5.7%
Latin 6
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
38
 
5.6%
31
 
4.6%
31
 
4.6%
27
 
4.0%
20
 
2.9%
17
 
2.5%
17
 
2.5%
15
 
2.2%
15
 
2.2%
14
 
2.1%
Other values (148) 453
66.8%
Common
ValueCountFrequency (%)
16
39.0%
( 10
24.4%
) 10
24.4%
' 2
 
4.9%
· 2
 
4.9%
- 1
 
2.4%
Latin
ValueCountFrequency (%)
C 2
33.3%
M 1
16.7%
O 1
16.7%
R 1
16.7%
I 1
16.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 670
92.4%
ASCII 45
 
6.2%
None 10
 
1.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
38
 
5.7%
31
 
4.6%
31
 
4.6%
27
 
4.0%
20
 
3.0%
17
 
2.5%
17
 
2.5%
15
 
2.2%
15
 
2.2%
14
 
2.1%
Other values (147) 445
66.4%
ASCII
ValueCountFrequency (%)
16
35.6%
( 10
22.2%
) 10
22.2%
' 2
 
4.4%
C 2
 
4.4%
M 1
 
2.2%
O 1
 
2.2%
R 1
 
2.2%
I 1
 
2.2%
- 1
 
2.2%
None
ValueCountFrequency (%)
8
80.0%
· 2
 
20.0%

국가
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct8
Distinct (%)10.4%
Missing0
Missing (%)0.0%
Memory size748.0 B
한국
65 
일본
 
6
미국
 
1
러시아
 
1
중국
 
1
Other values (3)
 
3

Length

Max length7
Median length2
Mean length2.1038961
Min length2

Unique

Unique6 ?
Unique (%)7.8%

Sample

1st row한국
2nd row한국
3rd row한국
4th row한국
5th row한국

Common Values

ValueCountFrequency (%)
한국 65
84.4%
일본 6
 
7.8%
미국 1
 
1.3%
러시아 1
 
1.3%
중국 1
 
1.3%
이탈리아 1
 
1.3%
몽골 1
 
1.3%
사우디아라비아 1
 
1.3%

Length

2023-12-12T20:21:25.984084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:21:26.220546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
한국 65
84.4%
일본 6
 
7.8%
미국 1
 
1.3%
러시아 1
 
1.3%
중국 1
 
1.3%
이탈리아 1
 
1.3%
몽골 1
 
1.3%
사우디아라비아 1
 
1.3%
Distinct48
Distinct (%)62.3%
Missing0
Missing (%)0.0%
Memory size748.0 B
2023-12-12T20:21:26.552375image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length38
Median length29
Mean length12.363636
Min length4

Characters and Unicode

Total characters952
Distinct characters147
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique35 ?
Unique (%)45.5%

Sample

1st row산학협력 협약
2nd row산학협력
3rd row3차원 스캐닝시스템 공동운용 협약
4th row학술협력협정
5th row교류협력 협정
ValueCountFrequency (%)
업무협약 27
 
13.7%
위한 11
 
5.6%
교류협력 8
 
4.1%
협약 8
 
4.1%
산학협력 7
 
3.6%
협정 6
 
3.0%
6
 
3.0%
운영 5
 
2.5%
지원 5
 
2.5%
포함 5
 
2.5%
Other values (83) 109
55.3%
2023-12-12T20:21:27.120552image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
120
 
12.6%
106
 
11.1%
61
 
6.4%
36
 
3.8%
35
 
3.7%
33
 
3.5%
31
 
3.3%
30
 
3.2%
29
 
3.0%
19
 
2.0%
Other values (137) 452
47.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 794
83.4%
Space Separator 120
 
12.6%
Close Punctuation 10
 
1.1%
Open Punctuation 10
 
1.1%
Other Punctuation 8
 
0.8%
Uppercase Letter 8
 
0.8%
Dash Punctuation 1
 
0.1%
Decimal Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
106
 
13.4%
61
 
7.7%
36
 
4.5%
35
 
4.4%
33
 
4.2%
31
 
3.9%
30
 
3.8%
29
 
3.7%
19
 
2.4%
19
 
2.4%
Other values (122) 395
49.7%
Uppercase Letter
ValueCountFrequency (%)
O 2
25.0%
D 2
25.0%
M 1
12.5%
C 1
12.5%
K 1
12.5%
P 1
12.5%
Close Punctuation
ValueCountFrequency (%)
) 9
90.0%
1
 
10.0%
Open Punctuation
ValueCountFrequency (%)
( 9
90.0%
1
 
10.0%
Other Punctuation
ValueCountFrequency (%)
· 7
87.5%
, 1
 
12.5%
Space Separator
ValueCountFrequency (%)
120
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Decimal Number
ValueCountFrequency (%)
3 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 794
83.4%
Common 150
 
15.8%
Latin 8
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
106
 
13.4%
61
 
7.7%
36
 
4.5%
35
 
4.4%
33
 
4.2%
31
 
3.9%
30
 
3.8%
29
 
3.7%
19
 
2.4%
19
 
2.4%
Other values (122) 395
49.7%
Common
ValueCountFrequency (%)
120
80.0%
) 9
 
6.0%
( 9
 
6.0%
· 7
 
4.7%
- 1
 
0.7%
, 1
 
0.7%
3 1
 
0.7%
1
 
0.7%
1
 
0.7%
Latin
ValueCountFrequency (%)
O 2
25.0%
D 2
25.0%
M 1
12.5%
C 1
12.5%
K 1
12.5%
P 1
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 794
83.4%
ASCII 149
 
15.7%
None 9
 
0.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
120
80.5%
) 9
 
6.0%
( 9
 
6.0%
O 2
 
1.3%
D 2
 
1.3%
M 1
 
0.7%
C 1
 
0.7%
K 1
 
0.7%
- 1
 
0.7%
P 1
 
0.7%
Other values (2) 2
 
1.3%
Hangul
ValueCountFrequency (%)
106
 
13.4%
61
 
7.7%
36
 
4.5%
35
 
4.4%
33
 
4.2%
31
 
3.9%
30
 
3.8%
29
 
3.7%
19
 
2.4%
19
 
2.4%
Other values (122) 395
49.7%
None
ValueCountFrequency (%)
· 7
77.8%
1
 
11.1%
1
 
11.1%

Correlations

2023-12-12T20:21:27.300174image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
국내외구분체결일자체결기관명국가협정명
국내외구분1.0001.0001.0001.0001.000
체결일자1.0001.0000.9481.0000.998
체결기관명1.0000.9481.0001.0000.000
국가1.0001.0001.0001.0000.734
협정명1.0000.9980.0000.7341.000
2023-12-12T20:21:27.468354image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
국내외구분국가
국내외구분1.0000.959
국가0.9591.000
2023-12-12T20:21:27.616021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
국내외구분국가
국내외구분1.0000.959
국가0.9591.000

Missing values

2023-12-12T20:21:23.960544image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:21:24.098333image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

국내외구분체결일자체결기관명국가협정명
0국내기관2004-12-21(사)한국문화재기능인협회한국산학협력 협약
1국내기관2005-07-17㈜엔가드한국산학협력
2국내기관2006-05-10국립부여문화재연구소한국3차원 스캐닝시스템 공동운용 협약
3국내기관2006-06-27국립중앙과학관한국학술협력협정
4국내기관2008-08-21국립부여박물관한국교류협력 협정
5국내기관2009-02-18㈜한켐한국산학협력 협정(장학금 지원 포함)
6국내기관2009-03-10㈜팜클한국산학협력 협정(장학금 지원 포함)
7국내기관2009-07-08한국교육과정평가원한국교류협력 협정서
8국내기관2010-04-01한국문화재재단한국산학협력 협정
9국내기관2010-04-09국립중앙박물관문화재단한국산학협력 협정
국내외구분체결일자체결기관명국가협정명
67국외기관2015-03-31규슈국립박물관일본학술문화교류협정(갱신)
68국외기관2020-03-31규슈국립박물관일본학술문화교류협정(갱신)
69국외기관2008-11-07나라문화재연구소일본학술문화교류협정(개정, 명칭변경)
70국외기관2010-02-10스미스소니언 협회미국학술교류협약서
71국외기관2010-03-23러시아과학원극동지소 역사학고고학민속학연구소러시아문화교류에 관한 협정서
72국외기관2012-05-24중국실크박물관중국전문역량강화를 위한 협력에 관한 협정
73국외기관2021-08-18입명관대학교 역사도시방재연구소일본학술교류협약체결
74국외기관2022-10-12국제문화재보존복구센터(ICCROM)이탈리아학술교류협약체결
75국외기관2023-03-07몽골 과학아카데미 고고학연구소몽골학술교류협약체결
76국외기관2023-03-15사우디아라비아 왕립전통예술원사우디아라비아학술교류협약체결