Overview

Dataset statistics

Number of variables6
Number of observations248
Missing cells496
Missing cells (%)33.3%
Duplicate rows1
Duplicate rows (%)0.4%
Total size in memory12.2 KiB
Average record size in memory50.5 B

Variable types

Numeric2
Categorical2
Text1
DateTime1

Dataset

Description한국해양수산연수원의 대표홈페이지 보도자료 목록 데이터를 제공합니다.- 연번, 구분, 게시번호, 제목, 작성자, 작성일 등※ 기준연도 : 2020-2023. 8.
Author한국해양수산연수원
URLhttps://www.data.go.kr/data/15121734/fileData.do

Alerts

Dataset has 1 (0.4%) duplicate rowsDuplicates
작성자 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
구분 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
연번 is highly overall correlated with 게시번호 and 2 other fieldsHigh correlation
게시번호 is highly overall correlated with 연번 and 2 other fieldsHigh correlation
연번 has 124 (50.0%) missing valuesMissing
게시번호 has 124 (50.0%) missing valuesMissing
제목 has 124 (50.0%) missing valuesMissing
작성일 has 124 (50.0%) missing valuesMissing

Reproduction

Analysis started2023-12-12 14:54:25.158904
Analysis finished2023-12-12 14:54:26.256653
Duration1.1 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

연번
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct124
Distinct (%)100.0%
Missing124
Missing (%)50.0%
Infinite0
Infinite (%)0.0%
Mean62.5
Minimum1
Maximum124
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-12T23:54:26.342666image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.15
Q131.75
median62.5
Q393.25
95-th percentile117.85
Maximum124
Range123
Interquartile range (IQR)61.5

Descriptive statistics

Standard deviation35.939764
Coefficient of variation (CV)0.57503623
Kurtosis-1.2
Mean62.5
Median Absolute Deviation (MAD)31
Skewness0
Sum7750
Variance1291.6667
MonotonicityStrictly decreasing
2023-12-12T23:54:26.524850image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
45 1
 
0.4%
32 1
 
0.4%
33 1
 
0.4%
34 1
 
0.4%
35 1
 
0.4%
36 1
 
0.4%
37 1
 
0.4%
38 1
 
0.4%
39 1
 
0.4%
40 1
 
0.4%
Other values (114) 114
46.0%
(Missing) 124
50.0%
ValueCountFrequency (%)
1 1
0.4%
2 1
0.4%
3 1
0.4%
4 1
0.4%
5 1
0.4%
6 1
0.4%
7 1
0.4%
8 1
0.4%
9 1
0.4%
10 1
0.4%
ValueCountFrequency (%)
124 1
0.4%
123 1
0.4%
122 1
0.4%
121 1
0.4%
120 1
0.4%
119 1
0.4%
118 1
0.4%
117 1
0.4%
116 1
0.4%
115 1
0.4%

구분
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
보도자료
124 
<NA>
124 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row보도자료
2nd row보도자료
3rd row보도자료
4th row보도자료
5th row보도자료

Common Values

ValueCountFrequency (%)
보도자료 124
50.0%
<NA> 124
50.0%

Length

2023-12-12T23:54:26.658771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:54:26.762144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
보도자료 124
50.0%
na 124
50.0%

게시번호
Real number (ℝ)

HIGH CORRELATION  MISSING 

Distinct124
Distinct (%)100.0%
Missing124
Missing (%)50.0%
Infinite0
Infinite (%)0.0%
Mean2670.1048
Minimum1022
Maximum6651
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.3 KiB
2023-12-12T23:54:26.871393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1022
5-th percentile1028.15
Q11052.75
median1083.5
Q36554.75
95-th percentile6627.4
Maximum6651
Range5629
Interquartile range (IQR)5502

Descriptive statistics

Standard deviation2519.1134
Coefficient of variation (CV)0.94345111
Kurtosis-1.1433684
Mean2670.1048
Median Absolute Deviation (MAD)36
Skewness0.93521474
Sum331093
Variance6345932.2
MonotonicityNot monotonic
2023-12-12T23:54:27.039015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1066 1
 
0.4%
1052 1
 
0.4%
1054 1
 
0.4%
1055 1
 
0.4%
1056 1
 
0.4%
1057 1
 
0.4%
1058 1
 
0.4%
1059 1
 
0.4%
1060 1
 
0.4%
1061 1
 
0.4%
Other values (114) 114
46.0%
(Missing) 124
50.0%
ValueCountFrequency (%)
1022 1
0.4%
1023 1
0.4%
1024 1
0.4%
1025 1
0.4%
1026 1
0.4%
1027 1
0.4%
1028 1
0.4%
1029 1
0.4%
1030 1
0.4%
1031 1
0.4%
ValueCountFrequency (%)
6651 1
0.4%
6650 1
0.4%
6640 1
0.4%
6636 1
0.4%
6631 1
0.4%
6630 1
0.4%
6628 1
0.4%
6624 1
0.4%
6623 1
0.4%
6622 1
0.4%

제목
Text

MISSING 

Distinct124
Distinct (%)100.0%
Missing124
Missing (%)50.0%
Memory size2.1 KiB
2023-12-12T23:54:27.465538image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length61
Median length43
Mean length31.201613
Min length12

Characters and Unicode

Total characters3869
Distinct characters362
Distinct categories11 ?
Distinct scripts4 ?
Distinct blocks6 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique124 ?
Unique (%)100.0%

Sample

1st row미국 해사청장, 한국해양수산연수원 및 APEC SEN 방문
2nd row해사산업 발전을 위한 전문가 인적교류 증진 추진
3rd row한국해양수산연수원, 대국민 참여 혁신 아이디어 공모전 개최
4th row한국해양수산연수원, 환경보호 위한 ‘1회용품 제로 챌린지’ 동참
5th row한국해양수산연수원, 2023년 ODA 국제승선실습(GOBT) 프로그램 수료식 개최
ValueCountFrequency (%)
한국해양수산연수원 75
 
10.1%
개최 26
 
3.5%
위한 14
 
1.9%
오션폴리텍 11
 
1.5%
체결 10
 
1.3%
해기사 8
 
1.1%
교육생 8
 
1.1%
글로벌 8
 
1.1%
시행 8
 
1.1%
모집 8
 
1.1%
Other values (414) 569
76.4%
2023-12-12T23:54:28.122445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
627
 
16.2%
184
 
4.8%
132
 
3.4%
114
 
2.9%
107
 
2.8%
100
 
2.6%
100
 
2.6%
94
 
2.4%
83
 
2.1%
, 82
 
2.1%
Other values (352) 2246
58.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 2814
72.7%
Space Separator 627
 
16.2%
Uppercase Letter 134
 
3.5%
Other Punctuation 101
 
2.6%
Decimal Number 100
 
2.6%
Open Punctuation 25
 
0.6%
Close Punctuation 25
 
0.6%
Initial Punctuation 17
 
0.4%
Final Punctuation 17
 
0.4%
Lowercase Letter 8
 
0.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
184
 
6.5%
132
 
4.7%
114
 
4.1%
107
 
3.8%
100
 
3.6%
100
 
3.6%
94
 
3.3%
83
 
2.9%
43
 
1.5%
40
 
1.4%
Other values (297) 1817
64.6%
Uppercase Letter
ValueCountFrequency (%)
E 17
12.7%
O 13
 
9.7%
S 12
 
9.0%
M 10
 
7.5%
N 9
 
6.7%
A 9
 
6.7%
G 9
 
6.7%
T 8
 
6.0%
P 7
 
5.2%
C 6
 
4.5%
Other values (10) 34
25.4%
Decimal Number
ValueCountFrequency (%)
2 35
35.0%
0 23
23.0%
3 14
 
14.0%
1 12
 
12.0%
5 7
 
7.0%
4 3
 
3.0%
7 2
 
2.0%
9 2
 
2.0%
8 1
 
1.0%
6 1
 
1.0%
Lowercase Letter
ValueCountFrequency (%)
e 2
25.0%
c 1
12.5%
n 1
12.5%
l 1
12.5%
o 1
12.5%
i 1
12.5%
v 1
12.5%
Other Punctuation
ValueCountFrequency (%)
, 82
81.2%
· 7
 
6.9%
" 6
 
5.9%
! 4
 
4.0%
. 1
 
1.0%
# 1
 
1.0%
Open Punctuation
ValueCountFrequency (%)
( 14
56.0%
10
40.0%
1
 
4.0%
Close Punctuation
ValueCountFrequency (%)
) 14
56.0%
10
40.0%
1
 
4.0%
Initial Punctuation
ValueCountFrequency (%)
15
88.2%
2
 
11.8%
Final Punctuation
ValueCountFrequency (%)
15
88.2%
2
 
11.8%
Space Separator
ValueCountFrequency (%)
627
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 2812
72.7%
Common 913
 
23.6%
Latin 142
 
3.7%
Han 2
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
184
 
6.5%
132
 
4.7%
114
 
4.1%
107
 
3.8%
100
 
3.6%
100
 
3.6%
94
 
3.3%
83
 
3.0%
43
 
1.5%
40
 
1.4%
Other values (295) 1815
64.5%
Common
ValueCountFrequency (%)
627
68.7%
, 82
 
9.0%
2 35
 
3.8%
0 23
 
2.5%
15
 
1.6%
15
 
1.6%
( 14
 
1.5%
) 14
 
1.5%
3 14
 
1.5%
1 12
 
1.3%
Other values (18) 62
 
6.8%
Latin
ValueCountFrequency (%)
E 17
12.0%
O 13
 
9.2%
S 12
 
8.5%
M 10
 
7.0%
N 9
 
6.3%
A 9
 
6.3%
G 9
 
6.3%
T 8
 
5.6%
P 7
 
4.9%
C 6
 
4.2%
Other values (17) 42
29.6%
Han
ValueCountFrequency (%)
1
50.0%
1
50.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 2811
72.7%
ASCII 992
 
25.6%
Punctuation 34
 
0.9%
None 29
 
0.7%
CJK 2
 
0.1%
Compat Jamo 1
 
< 0.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
627
63.2%
, 82
 
8.3%
2 35
 
3.5%
0 23
 
2.3%
E 17
 
1.7%
( 14
 
1.4%
) 14
 
1.4%
3 14
 
1.4%
O 13
 
1.3%
S 12
 
1.2%
Other values (36) 141
 
14.2%
Hangul
ValueCountFrequency (%)
184
 
6.5%
132
 
4.7%
114
 
4.1%
107
 
3.8%
100
 
3.6%
100
 
3.6%
94
 
3.3%
83
 
3.0%
43
 
1.5%
40
 
1.4%
Other values (294) 1814
64.5%
Punctuation
ValueCountFrequency (%)
15
44.1%
15
44.1%
2
 
5.9%
2
 
5.9%
None
ValueCountFrequency (%)
10
34.5%
10
34.5%
· 7
24.1%
1
 
3.4%
1
 
3.4%
Compat Jamo
ValueCountFrequency (%)
1
100.0%
CJK
ValueCountFrequency (%)
1
50.0%
1
50.0%

작성자
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size2.1 KiB
<NA>
124 
김성현
76 
신정윤
39 
백세한
 
6
관리자
 
3

Length

Max length4
Median length3.5
Mean length3.5
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row신정윤
2nd row신정윤
3rd row신정윤
4th row신정윤
5th row신정윤

Common Values

ValueCountFrequency (%)
<NA> 124
50.0%
김성현 76
30.6%
신정윤 39
 
15.7%
백세한 6
 
2.4%
관리자 3
 
1.2%

Length

2023-12-12T23:54:28.339384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T23:54:28.471622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 124
50.0%
김성현 76
30.6%
신정윤 39
 
15.7%
백세한 6
 
2.4%
관리자 3
 
1.2%

작성일
Date

MISSING 

Distinct104
Distinct (%)83.9%
Missing124
Missing (%)50.0%
Memory size2.1 KiB
Minimum2020-01-22 00:00:00
Maximum2023-08-16 00:00:00
2023-12-12T23:54:28.613312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:54:28.773546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2023-12-12T23:54:25.695788image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:54:25.477626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:54:25.788237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T23:54:25.583659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T23:54:28.903299image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번게시번호작성자
연번1.0001.0000.847
게시번호1.0001.0000.986
작성자0.8470.9861.000
2023-12-12T23:54:28.996893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
작성자구분
작성자1.0001.000
구분1.0001.000
2023-12-12T23:54:29.083737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
연번게시번호구분작성자
연번1.0001.0001.0000.675
게시번호1.0001.0001.0000.885
구분1.0001.0001.0001.000
작성자0.6750.8851.0001.000

Missing values

2023-12-12T23:54:25.917379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T23:54:26.051578image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.
2023-12-12T23:54:26.167619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
The correlation heatmap measures nullity correlation: how strongly the presence or absence of one variable affects the presence of another.

Sample

연번구분게시번호제목작성자작성일
0124보도자료6651미국 해사청장, 한국해양수산연수원 및 APEC SEN 방문신정윤2023-08-16
1123보도자료6650해사산업 발전을 위한 전문가 인적교류 증진 추진신정윤2023-08-09
2122보도자료6640한국해양수산연수원, 대국민 참여 혁신 아이디어 공모전 개최신정윤2023-07-28
3121보도자료6636한국해양수산연수원, 환경보호 위한 ‘1회용품 제로 챌린지’ 동참신정윤2023-07-27
4120보도자료6631한국해양수산연수원, 2023년 ODA 국제승선실습(GOBT) 프로그램 수료식 개최신정윤2023-07-26
5119보도자료6630한국해양수산연수원, 정전 70주년 ‘6.25 참전유공자 땡큐챌린지’ 동참신정윤2023-07-26
6118보도자료6628푸른 바다의 미래, 글로벌 해운 리더를 꿈꾸다!신정윤2023-07-25
7117보도자료6624한국해양수산연수원, 원양어선 해기사 단기 양성과정 전원취업 배출신정윤2023-07-21
8116보도자료6623한국해양수산연수원, 부패방지경영시스템(ISO37001) 국제표준 인증 획득신정윤2023-07-21
9115보도자료6622한국해양수산연수원, ESG경영 실천을 위한 제3기 KIMFT 시민참여혁신단 모집신정윤2023-07-21
연번구분게시번호제목작성자작성일
238<NA><NA><NA><NA><NA><NA>
239<NA><NA><NA><NA><NA><NA>
240<NA><NA><NA><NA><NA><NA>
241<NA><NA><NA><NA><NA><NA>
242<NA><NA><NA><NA><NA><NA>
243<NA><NA><NA><NA><NA><NA>
244<NA><NA><NA><NA><NA><NA>
245<NA><NA><NA><NA><NA><NA>
246<NA><NA><NA><NA><NA><NA>
247<NA><NA><NA><NA><NA><NA>

Duplicate rows

Most frequently occurring

연번구분게시번호제목작성자작성일# duplicates
0<NA><NA><NA><NA><NA><NA>124