Overview

Dataset statistics

Number of variables7
Number of observations39
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.3 KiB
Average record size in memory61.4 B

Variable types

Text2
Categorical3
Numeric2

Dataset

Description사상구 관내 어선등록현황(어선명, 어선번호,어업방법, 추진기관,마력 ,총톤수,선체의 재질)등 의 정보를 제공합니다.
Author부산광역시 사상구
URLhttps://www.data.go.kr/data/15048112/fileData.do

Alerts

추진기관 is highly overall correlated with 마력 and 1 other fieldsHigh correlation
선체재질 is highly overall correlated with 마력 and 1 other fieldsHigh correlation
톤수 is highly overall correlated with 마력High correlation
마력 is highly overall correlated with 톤수 and 2 other fieldsHigh correlation
선체재질 is highly imbalanced (82.8%)Imbalance
추진기관 is highly imbalanced (53.1%)Imbalance
어선명 has unique valuesUnique
어선번호 has unique valuesUnique

Reproduction

Analysis started2024-04-21 01:13:46.174910
Analysis finished2024-04-21 01:13:48.395612
Duration2.22 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

어선명
Text

UNIQUE 

Distinct39
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size444.0 B
2024-04-21T10:13:48.527244image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length5
Median length3
Mean length2.974359
Min length1

Characters and Unicode

Total characters116
Distinct characters57
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)100.0%

Sample

1st row불광호
2nd row재관호
3rd row영진호
4th row세일호
5th row오동
ValueCountFrequency (%)
불광호 1
 
2.6%
용삼호 1
 
2.6%
엄광1호 1
 
2.6%
진우호 1
 
2.6%
장인도호 1
 
2.6%
장석호 1
 
2.6%
엄광호 1
 
2.6%
태정호 1
 
2.6%
바다호 1
 
2.6%
영조호 1
 
2.6%
Other values (29) 29
74.4%
2024-04-21T10:13:48.830209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
29
25.0%
4
 
3.4%
4
 
3.4%
4
 
3.4%
3
 
2.6%
3
 
2.6%
3
 
2.6%
2 3
 
2.6%
3
 
2.6%
1 3
 
2.6%
Other values (47) 57
49.1%

Most occurring categories

ValueCountFrequency (%)
Other Letter 108
93.1%
Decimal Number 8
 
6.9%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
29
26.9%
4
 
3.7%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
Other values (43) 50
46.3%
Decimal Number
ValueCountFrequency (%)
2 3
37.5%
1 3
37.5%
8 1
 
12.5%
9 1
 
12.5%

Most occurring scripts

ValueCountFrequency (%)
Hangul 108
93.1%
Common 8
 
6.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
29
26.9%
4
 
3.7%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
Other values (43) 50
46.3%
Common
ValueCountFrequency (%)
2 3
37.5%
1 3
37.5%
8 1
 
12.5%
9 1
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
Hangul 108
93.1%
ASCII 8
 
6.9%

Most frequent character per block

Hangul
ValueCountFrequency (%)
29
26.9%
4
 
3.7%
4
 
3.7%
4
 
3.7%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
3
 
2.8%
2
 
1.9%
Other values (43) 50
46.3%
ASCII
ValueCountFrequency (%)
2 3
37.5%
1 3
37.5%
8 1
 
12.5%
9 1
 
12.5%

어선번호
Text

UNIQUE 

Distinct39
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size444.0 B
2024-04-21T10:13:49.012082image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length15
Mean length15
Min length15

Characters and Unicode

Total characters585
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique39 ?
Unique (%)100.0%

Sample

1st row0004004-6267103
2nd row0005001-6265302
3rd row0005014-6264406
4th row0010019-6264400
5th row0102001-6265306
ValueCountFrequency (%)
0004004-6267103 1
 
2.6%
1104001-6265301 1
 
2.6%
1508001-6265305 1
 
2.6%
1512001-6265306 1
 
2.6%
1604001-6265301 1
 
2.6%
1802001-6265301 1
 
2.6%
2206001-6265304 1
 
2.6%
9502027-6265309 1
 
2.6%
9503003-6477701 1
 
2.6%
1107001-6265305 1
 
2.6%
Other values (29) 29
74.4%
2024-04-21T10:13:49.312830image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 174
29.7%
6 89
15.2%
1 63
 
10.8%
2 60
 
10.3%
5 42
 
7.2%
3 41
 
7.0%
- 39
 
6.7%
4 32
 
5.5%
9 19
 
3.2%
7 16
 
2.7%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 546
93.3%
Dash Punctuation 39
 
6.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 174
31.9%
6 89
16.3%
1 63
 
11.5%
2 60
 
11.0%
5 42
 
7.7%
3 41
 
7.5%
4 32
 
5.9%
9 19
 
3.5%
7 16
 
2.9%
8 10
 
1.8%
Dash Punctuation
ValueCountFrequency (%)
- 39
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 585
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 174
29.7%
6 89
15.2%
1 63
 
10.8%
2 60
 
10.3%
5 42
 
7.2%
3 41
 
7.0%
- 39
 
6.7%
4 32
 
5.5%
9 19
 
3.2%
7 16
 
2.7%

Most occurring blocks

ValueCountFrequency (%)
ASCII 585
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 174
29.7%
6 89
15.2%
1 63
 
10.8%
2 60
 
10.3%
5 42
 
7.2%
3 41
 
7.0%
- 39
 
6.7%
4 32
 
5.5%
9 19
 
3.2%
7 16
 
2.7%

어업방법
Categorical

Distinct7
Distinct (%)17.9%
Missing0
Missing (%)0.0%
Memory size444.0 B
패류채취어업
11 
연승어업
10 
연안자망어업
연안복합어업
자망어업
Other values (2)

Length

Max length6
Median length6
Mean length5.2820513
Min length2

Unique

Unique1 ?
Unique (%)2.6%

Sample

1st row연승어업
2nd row패류채취어업
3rd row패류채취어업
4th row연안복합어업
5th row자망어업

Common Values

ValueCountFrequency (%)
패류채취어업 11
28.2%
연승어업 10
25.6%
연안자망어업 8
20.5%
연안복합어업 5
12.8%
자망어업 2
 
5.1%
연안통발어업 2
 
5.1%
기타 1
 
2.6%

Length

2024-04-21T10:13:49.441480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T10:13:49.558232image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
패류채취어업 11
28.2%
연승어업 10
25.6%
연안자망어업 8
20.5%
연안복합어업 5
12.8%
자망어업 2
 
5.1%
연안통발어업 2
 
5.1%
기타 1
 
2.6%

톤수
Real number (ℝ)

HIGH CORRELATION 

Distinct30
Distinct (%)76.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.444359
Minimum0.57
Maximum3.49
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size483.0 B
2024-04-21T10:13:49.662748image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.57
5-th percentile0.626
Q10.98
median1.17
Q31.855
95-th percentile3.018
Maximum3.49
Range2.92
Interquartile range (IQR)0.875

Descriptive statistics

Standard deviation0.79617439
Coefficient of variation (CV)0.55123027
Kurtosis0.48404585
Mean1.444359
Median Absolute Deviation (MAD)0.31
Skewness1.2833574
Sum56.33
Variance0.63389366
MonotonicityNot monotonic
2024-04-21T10:13:49.768308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=30)
ValueCountFrequency (%)
1.18 4
 
10.3%
0.98 3
 
7.7%
0.85 2
 
5.1%
1.17 2
 
5.1%
1.07 2
 
5.1%
1.2 2
 
5.1%
2.86 1
 
2.6%
2.81 1
 
2.6%
0.76 1
 
2.6%
3.27 1
 
2.6%
Other values (20) 20
51.3%
ValueCountFrequency (%)
0.57 1
 
2.6%
0.59 1
 
2.6%
0.63 1
 
2.6%
0.76 1
 
2.6%
0.82 1
 
2.6%
0.85 2
5.1%
0.86 1
 
2.6%
0.89 1
 
2.6%
0.98 3
7.7%
1.0 1
 
2.6%
ValueCountFrequency (%)
3.49 1
2.6%
3.27 1
2.6%
2.99 1
2.6%
2.86 1
2.6%
2.81 1
2.6%
2.71 1
2.6%
2.53 1
2.6%
1.97 1
2.6%
1.93 1
2.6%
1.86 1
2.6%

선체재질
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size444.0 B
FRP
38 
 
1

Length

Max length3
Median length3
Mean length2.9487179
Min length1

Unique

Unique1 ?
Unique (%)2.6%

Sample

1st rowFRP
2nd rowFRP
3rd rowFRP
4th rowFRP
5th rowFRP

Common Values

ValueCountFrequency (%)
FRP 38
97.4%
1
 
2.6%

Length

2024-04-21T10:13:49.914553image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T10:13:50.016008image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
frp 38
97.4%
1
 
2.6%

추진기관
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct4
Distinct (%)10.3%
Missing0
Missing (%)0.0%
Memory size444.0 B
가솔린선외기
32 
선박용디젤
 
3
가솔린기타
 
3
육상경운기용디젤
 
1

Length

Max length8
Median length6
Mean length5.8974359
Min length5

Unique

Unique1 ?
Unique (%)2.6%

Sample

1st row선박용디젤
2nd row가솔린선외기
3rd row가솔린선외기
4th row가솔린선외기
5th row가솔린선외기

Common Values

ValueCountFrequency (%)
가솔린선외기 32
82.1%
선박용디젤 3
 
7.7%
가솔린기타 3
 
7.7%
육상경운기용디젤 1
 
2.6%

Length

2024-04-21T10:13:50.130367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-21T10:13:50.228864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
가솔린선외기 32
82.1%
선박용디젤 3
 
7.7%
가솔린기타 3
 
7.7%
육상경운기용디젤 1
 
2.6%

마력
Real number (ℝ)

HIGH CORRELATION 

Distinct14
Distinct (%)35.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean133.66667
Minimum10
Maximum300
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size483.0 B
2024-04-21T10:13:50.314350image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum10
5-th percentile49
Q160
median115
Q3164
95-th percentile300
Maximum300
Range290
Interquartile range (IQR)104

Descriptive statistics

Standard deviation74.73544
Coefficient of variation (CV)0.559118
Kurtosis0.043153382
Mean133.66667
Median Absolute Deviation (MAD)55
Skewness0.74289435
Sum5213
Variance5585.386
MonotonicityNot monotonic
2024-04-21T10:13:50.421645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
60 8
20.5%
115 7
17.9%
150 6
15.4%
200 3
 
7.7%
300 3
 
7.7%
90 2
 
5.1%
160 2
 
5.1%
250 2
 
5.1%
168 1
 
2.6%
175 1
 
2.6%
Other values (4) 4
10.3%
ValueCountFrequency (%)
10 1
 
2.6%
40 1
 
2.6%
50 1
 
2.6%
60 8
20.5%
85 1
 
2.6%
90 2
 
5.1%
115 7
17.9%
150 6
15.4%
160 2
 
5.1%
168 1
 
2.6%
ValueCountFrequency (%)
300 3
7.7%
250 2
 
5.1%
200 3
7.7%
175 1
 
2.6%
168 1
 
2.6%
160 2
 
5.1%
150 6
15.4%
115 7
17.9%
90 2
 
5.1%
85 1
 
2.6%

Interactions

2024-04-21T10:13:48.079159image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:13:47.884183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:13:48.159540image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-04-21T10:13:48.001184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-04-21T10:13:50.571737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
어선명어선번호어업방법톤수선체재질추진기관마력
어선명1.0001.0001.0001.0001.0001.0001.000
어선번호1.0001.0001.0001.0001.0001.0001.000
어업방법1.0001.0001.0000.6680.0000.0000.386
톤수1.0001.0000.6681.0000.0000.6530.856
선체재질1.0001.0000.0000.0001.0001.0001.000
추진기관1.0001.0000.0000.6531.0001.0000.889
마력1.0001.0000.3860.8561.0000.8891.000
2024-04-21T10:13:50.745063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
어업방법추진기관선체재질
어업방법1.0000.0000.000
추진기관0.0001.0000.973
선체재질0.0000.9731.000
2024-04-21T10:13:50.850411image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
톤수마력어업방법선체재질추진기관
톤수1.0000.8150.4150.0000.442
마력0.8151.0000.1930.9000.743
어업방법0.4150.1931.0000.0000.000
선체재질0.0000.9000.0001.0000.973
추진기관0.4420.7430.0000.9731.000

Missing values

2024-04-21T10:13:48.256574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-04-21T10:13:48.355701image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

어선명어선번호어업방법톤수선체재질추진기관마력
0불광호0004004-6267103연승어업3.49FRP선박용디젤168
1재관호0005001-6265302패류채취어업1.86FRP가솔린선외기175
2영진호0005014-6264406패류채취어업1.07FRP가솔린선외기115
3세일호0010019-6264400연안복합어업1.18FRP가솔린선외기115
4오동0102001-6265306자망어업1.0FRP가솔린선외기115
5동성0102002-6265305패류채취어업1.2FRP가솔린선외기200
6연이호0102005-6317103자망어업0.89FRP가솔린선외기115
7태종호0106002-6262006연안통발어업0.86FRP가솔린선외기60
8회창0109002-6265301연안자망어업1.51FRP가솔린선외기150
9원만호0201004-6263207연승어업0.85FRP가솔린선외기60
어선명어선번호어업방법톤수선체재질추진기관마력
29야마호9504054-6214401패류채취어업0.85FRP가솔린선외기115
30도연호9506232-6214401연승어업0.63FRP가솔린선외기50
319706001-6265307연승어업0.98FRP가솔린선외기60
32혁기9706002-6265306연승어업0.98FRP가솔린선외기60
33만선9706003-6265305패류채취어업0.98FRP가솔린선외기115
34두근호9708001-6263805연승어업0.59FRP가솔린선외기40
35삼화19709001-6265301연승어업0.57FRP가솔린선외기60
36조양호9907001-6265301연승어업1.05FRP가솔린선외기85
37영철호9908001-6263203연안자망어업1.07FRP가솔린선외기115
38선기호9910001-6265304패류채취어업1.03FRP가솔린선외기60