Overview

Dataset statistics

Number of variables5
Number of observations1351
Missing cells0
Missing cells (%)0.0%
Duplicate rows176
Duplicate rows (%)13.0%
Total size in memory52.9 KiB
Average record size in memory40.1 B

Variable types

Categorical3
DateTime1
Text1

Dataset

Description2017년~2023년 10월 5일 기준 연안체험활동 신고현황에 관한 데이터로서 해경서,신고종류,활동시작일,활동종료일,활동형태 등의 항목을 제공합니다.
Author해양경찰청
URLhttps://www.data.go.kr/data/15092368/fileData.do

Alerts

Dataset has 176 (13.0%) duplicate rowsDuplicates
해경서 is highly overall correlated with 활동형태High correlation
활동형태 is highly overall correlated with 해경서High correlation
활동형태 is highly imbalanced (62.8%)Imbalance

Reproduction

Analysis started2023-12-12 05:53:46.050805
Analysis finished2023-12-12 05:53:46.457942
Duration0.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

해경서
Categorical

HIGH CORRELATION 

Distinct15
Distinct (%)1.1%
Missing0
Missing (%)0.0%
Memory size10.7 KiB
동해
352 
통영
268 
여수
263 
포항
140 
속초
127 
Other values (10)
201 

Length

Max length3
Median length2
Mean length2.0710585
Min length2

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row통영
2nd row통영
3rd row통영
4th row통영
5th row동해

Common Values

ValueCountFrequency (%)
동해 352
26.1%
통영 268
19.8%
여수 263
19.5%
포항 140
 
10.4%
속초 127
 
9.4%
서귀포 96
 
7.1%
부산 39
 
2.9%
제주 38
 
2.8%
완도 9
 
0.7%
울진 5
 
0.4%
Other values (5) 14
 
1.0%

Length

2023-12-12T14:53:46.532121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
동해 352
26.1%
통영 268
19.8%
여수 263
19.5%
포항 140
 
10.4%
속초 127
 
9.4%
서귀포 96
 
7.1%
부산 39
 
2.9%
제주 38
 
2.8%
완도 9
 
0.7%
울진 5
 
0.4%
Other values (5) 14
 
1.0%

신고종류
Categorical

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size10.7 KiB
건별신고
944 
기간제신고
310 
계획신고
 
93
계획(변경)신고
 
4

Length

Max length8
Median length4
Mean length4.2413027
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기간제신고
2nd row기간제신고
3rd row기간제신고
4th row기간제신고
5th row건별신고

Common Values

ValueCountFrequency (%)
건별신고 944
69.9%
기간제신고 310
 
22.9%
계획신고 93
 
6.9%
계획(변경)신고 4
 
0.3%

Length

2023-12-12T14:53:46.684519image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:53:46.808943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건별신고 944
69.9%
기간제신고 310
 
22.9%
계획신고 93
 
6.9%
계획(변경)신고 4
 
0.3%
Distinct925
Distinct (%)68.5%
Missing0
Missing (%)0.0%
Memory size10.7 KiB
Minimum2017-06-19 00:00:00
Maximum2023-10-29 12:00:00
2023-12-12T14:53:46.977022image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T14:53:47.130559image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
Distinct884
Distinct (%)65.4%
Missing0
Missing (%)0.0%
Memory size10.7 KiB
2023-12-12T14:53:47.460427image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length19
Median length16
Mean length16.008882
Min length16

Characters and Unicode

Total characters21628
Distinct characters15
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique649 ?
Unique (%)48.0%

Sample

1st row2017-09-18 00:00
2nd row2017-09-20 00:00
3rd row2017-09-30 00:00
4th row2017-09-30 00:00
5th row2017-07-01 18:00
ValueCountFrequency (%)
00:00 349
 
12.9%
18:00 124
 
4.6%
17:00 92
 
3.4%
16:00 77
 
2.8%
14:00 48
 
1.8%
17:30 48
 
1.8%
13:00 44
 
1.6%
15:00 40
 
1.5%
12:00 36
 
1.3%
11:00 27
 
1.0%
Other values (641) 1821
67.3%
2023-12-12T14:53:47.936640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 6245
28.9%
1 3010
13.9%
2 2728
12.6%
- 2702
12.5%
1355
 
6.3%
: 1351
 
6.2%
7 1221
 
5.6%
8 933
 
4.3%
3 725
 
3.4%
9 411
 
1.9%
Other values (5) 947
 
4.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 16212
75.0%
Dash Punctuation 2702
 
12.5%
Space Separator 1355
 
6.3%
Other Punctuation 1351
 
6.2%
Uppercase Letter 8
 
< 0.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 6245
38.5%
1 3010
18.6%
2 2728
16.8%
7 1221
 
7.5%
8 933
 
5.8%
3 725
 
4.5%
9 411
 
2.5%
6 357
 
2.2%
5 325
 
2.0%
4 257
 
1.6%
Uppercase Letter
ValueCountFrequency (%)
P 4
50.0%
M 4
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 2702
100.0%
Space Separator
ValueCountFrequency (%)
1355
100.0%
Other Punctuation
ValueCountFrequency (%)
: 1351
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 21620
> 99.9%
Latin 8
 
< 0.1%

Most frequent character per script

Common
ValueCountFrequency (%)
0 6245
28.9%
1 3010
13.9%
2 2728
12.6%
- 2702
12.5%
1355
 
6.3%
: 1351
 
6.2%
7 1221
 
5.6%
8 933
 
4.3%
3 725
 
3.4%
9 411
 
1.9%
Other values (3) 939
 
4.3%
Latin
ValueCountFrequency (%)
P 4
50.0%
M 4
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 21628
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 6245
28.9%
1 3010
13.9%
2 2728
12.6%
- 2702
12.5%
1355
 
6.3%
: 1351
 
6.2%
7 1221
 
5.6%
8 933
 
4.3%
3 725
 
3.4%
9 411
 
1.9%
Other values (5) 947
 
4.4%

활동형태
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size10.7 KiB
수중형
1109 
수상형
227 
일반형
 
13
강원특별자치도
 
2

Length

Max length7
Median length3
Mean length3.0059215
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row수중형
2nd row수중형
3rd row수중형
4th row수중형
5th row수중형

Common Values

ValueCountFrequency (%)
수중형 1109
82.1%
수상형 227
 
16.8%
일반형 13
 
1.0%
강원특별자치도 2
 
0.1%

Length

2023-12-12T14:53:48.116901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T14:53:48.244369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
수중형 1109
82.1%
수상형 227
 
16.8%
일반형 13
 
1.0%
강원특별자치도 2
 
0.1%

Correlations

2023-12-12T14:53:48.317499image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해경서신고종류활동형태
해경서1.0000.6560.822
신고종류0.6561.0000.490
활동형태0.8220.4901.000
2023-12-12T14:53:48.421862image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
신고종류활동형태해경서
신고종류1.0000.2080.435
활동형태0.2081.0000.629
해경서0.4350.6291.000
2023-12-12T14:53:48.518075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
해경서신고종류활동형태
해경서1.0000.4350.629
신고종류0.4351.0000.208
활동형태0.6290.2081.000

Missing values

2023-12-12T14:53:46.296215image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T14:53:46.410140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

해경서신고종류활동시작일활동종료일활동형태
0통영기간제신고2017-06-19 00:002017-09-18 00:00수중형
1통영기간제신고2017-06-21 00:002017-09-20 00:00수중형
2통영기간제신고2017-06-26 00:002017-09-30 00:00수중형
3통영기간제신고2017-06-27 00:002017-09-30 00:00수중형
4동해건별신고2017-07-01 07:002017-07-01 18:00수중형
5동해건별신고2017-07-01 08:402017-07-01 11:30수중형
6동해건별신고2017-07-01 09:002017-07-01 18:00수중형
7동해건별신고2017-07-01 09:002017-07-01 11:50수중형
8동해건별신고2017-07-01 13:202017-07-01 16:30수중형
9동해건별신고2017-07-02 08:202017-07-02 11:20수중형
해경서신고종류활동시작일활동종료일활동형태
1341부산계획신고2023-10-07 12:002023-10-07 16:00수중형
1342부산계획신고2023-10-07 12:002023-10-07 16:00수중형
1343부산계획신고2023-10-07 12:002023-10-07 16:00수중형
1344서귀포기간제신고2023-10-08 00:002023-10-08 00:00수상형
1345부산계획신고2023-10-14 12:002023-10-14 16:00수중형
1346부산계획신고2023-10-15 12:002023-10-15 16:00수중형
1347부산계획신고2023-10-21 12:002023-10-21 16:00수중형
1348부산계획신고2023-10-22 12:002023-10-22 16:00수중형
1349부산계획신고2023-10-28 12:002023-10-28 16:00수중형
1350부산계획신고2023-10-29 12:002023-10-29 16:00수중형

Duplicate rows

Most frequently occurring

해경서신고종류활동시작일활동종료일활동형태# duplicates
110여수기간제신고2020-06-21 00:002020-09-20 00:00수중형11
98여수건별신고2021-08-10 13:002021-08-10 18:00수중형8
166통영기간제신고2020-07-09 00:002020-09-30 00:00수상형7
11동해건별신고2017-08-06 21:302017-08-06 22:30수중형6
28동해기간제신고2017-07-14 00:002017-10-13 00:00수중형6
84여수건별신고2020-08-30 22:272020-08-30 13:30수중형6
100여수건별신고2021-08-28 10:002021-08-28 13:00수중형6
164통영기간제신고2017-07-27 00:002017-10-26 00:00수중형6
26동해건별신고2017-09-10 09:002017-09-10 14:30수중형5
71여수건별신고2019-10-30 11:002019-10-30 13:00수중형5