Overview

Dataset statistics

Number of variables8
Number of observations326
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory21.5 KiB
Average record size in memory67.4 B

Variable types

Numeric3
DateTime2
Categorical2
Text1

Dataset

Description남해군 내 수상기구등록정보에 대한 데이터로 등록일자, 소유구분, 기구종류, 기구명, 총톤수, 승선정원 등의 정보를 제공합니다.
Author경상남도 남해군
URLhttps://bigdata.gyeongnam.go.kr/index.gn?menuCd=DOM_000000114002001000&publicdatapk=15109580

Alerts

데이터기준일자 has constant value ""Constant
총톤수 is highly overall correlated with 승선정원High correlation
승선정원 is highly overall correlated with 총톤수High correlation
소유구분 is highly imbalanced (79.1%)Imbalance
자료일련번호 has unique valuesUnique
총톤수 has 47 (14.4%) zerosZeros

Reproduction

Analysis started2023-12-11 00:18:38.201549
Analysis finished2023-12-11 00:18:39.640327
Duration1.44 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

자료일련번호
Real number (ℝ)

UNIQUE 

Distinct326
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean163.5
Minimum1
Maximum326
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.0 KiB
2023-12-11T09:18:39.703002image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile17.25
Q182.25
median163.5
Q3244.75
95-th percentile309.75
Maximum326
Range325
Interquartile range (IQR)162.5

Descriptive statistics

Standard deviation94.252321
Coefficient of variation (CV)0.57646679
Kurtosis-1.2
Mean163.5
Median Absolute Deviation (MAD)81.5
Skewness0
Sum53301
Variance8883.5
MonotonicityStrictly increasing
2023-12-11T09:18:39.842658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
206 1
 
0.3%
224 1
 
0.3%
223 1
 
0.3%
222 1
 
0.3%
221 1
 
0.3%
220 1
 
0.3%
219 1
 
0.3%
218 1
 
0.3%
217 1
 
0.3%
Other values (316) 316
96.9%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
326 1
0.3%
325 1
0.3%
324 1
0.3%
323 1
0.3%
322 1
0.3%
321 1
0.3%
320 1
0.3%
319 1
0.3%
318 1
0.3%
317 1
0.3%
Distinct290
Distinct (%)89.0%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
Minimum2007-03-28 00:00:00
Maximum2022-11-01 00:00:00
2023-12-11T09:18:39.978508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:18:40.119644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

소유구분
Categorical

IMBALANCE 

Distinct3
Distinct (%)0.9%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
개인
310 
기타
 
8
법인
 
8

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row개인
2nd row개인
3rd row개인
4th row개인
5th row개인

Common Values

ValueCountFrequency (%)
개인 310
95.1%
기타 8
 
2.5%
법인 8
 
2.5%

Length

2023-12-11T09:18:40.241414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:18:40.346643image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
개인 310
95.1%
기타 8
 
2.5%
법인 8
 
2.5%

기구종류
Categorical

Distinct5
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
모터보트(선외기)
237 
수상오토바이
42 
모터보트(선내기)
31 
세일링요트
 
8
고무보트
 
8

Length

Max length9
Median length9
Mean length8.392638
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row모터보트(선외기)
2nd row모터보트(선외기)
3rd row모터보트(선내기)
4th row모터보트(선외기)
5th row모터보트(선외기)

Common Values

ValueCountFrequency (%)
모터보트(선외기) 237
72.7%
수상오토바이 42
 
12.9%
모터보트(선내기) 31
 
9.5%
세일링요트 8
 
2.5%
고무보트 8
 
2.5%

Length

2023-12-11T09:18:40.442749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-11T09:18:40.592468image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
모터보트(선외기 237
72.7%
수상오토바이 42
 
12.9%
모터보트(선내기 31
 
9.5%
세일링요트 8
 
2.5%
고무보트 8
 
2.5%
Distinct316
Distinct (%)96.9%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
2023-12-11T09:18:40.939295image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length14
Median length12
Mean length3.8680982
Min length2

Characters and Unicode

Total characters1261
Distinct characters271
Distinct categories8 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique307 ?
Unique (%)94.2%

Sample

1st row서영호
2nd rowmoon
3rd row봉진
4th row창성호
5th row세준호
ValueCountFrequency (%)
비너스 3
 
0.9%
행운호 2
 
0.6%
남해3호 2
 
0.6%
힐링호 2
 
0.6%
청사초롱호 2
 
0.6%
돌핀호 2
 
0.6%
갈매기호 2
 
0.6%
주리호 2
 
0.6%
해양호 2
 
0.6%
서영호 1
 
0.3%
Other values (316) 316
94.0%
2023-12-11T09:18:41.423654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
201
 
15.9%
34
 
2.7%
2 27
 
2.1%
24
 
1.9%
1 24
 
1.9%
21
 
1.7%
18
 
1.4%
3 17
 
1.3%
16
 
1.3%
16
 
1.3%
Other values (261) 863
68.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 984
78.0%
Decimal Number 102
 
8.1%
Uppercase Letter 99
 
7.9%
Lowercase Letter 58
 
4.6%
Space Separator 10
 
0.8%
Dash Punctuation 5
 
0.4%
Other Punctuation 2
 
0.2%
Letter Number 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
201
 
20.4%
34
 
3.5%
24
 
2.4%
21
 
2.1%
18
 
1.8%
16
 
1.6%
16
 
1.6%
14
 
1.4%
14
 
1.4%
13
 
1.3%
Other values (205) 613
62.3%
Uppercase Letter
ValueCountFrequency (%)
I 12
 
12.1%
N 9
 
9.1%
A 9
 
9.1%
M 7
 
7.1%
O 6
 
6.1%
E 6
 
6.1%
R 6
 
6.1%
J 5
 
5.1%
S 5
 
5.1%
D 4
 
4.0%
Other values (14) 30
30.3%
Lowercase Letter
ValueCountFrequency (%)
n 7
12.1%
o 7
12.1%
e 7
12.1%
i 6
10.3%
y 5
8.6%
r 4
 
6.9%
l 4
 
6.9%
t 3
 
5.2%
m 2
 
3.4%
d 2
 
3.4%
Other values (8) 11
19.0%
Decimal Number
ValueCountFrequency (%)
2 27
26.5%
1 24
23.5%
3 17
16.7%
5 8
 
7.8%
6 6
 
5.9%
7 6
 
5.9%
0 5
 
4.9%
4 5
 
4.9%
9 3
 
2.9%
8 1
 
1.0%
Space Separator
ValueCountFrequency (%)
10
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 5
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 984
78.0%
Latin 158
 
12.5%
Common 119
 
9.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
201
 
20.4%
34
 
3.5%
24
 
2.4%
21
 
2.1%
18
 
1.8%
16
 
1.6%
16
 
1.6%
14
 
1.4%
14
 
1.4%
13
 
1.3%
Other values (205) 613
62.3%
Latin
ValueCountFrequency (%)
I 12
 
7.6%
N 9
 
5.7%
A 9
 
5.7%
n 7
 
4.4%
o 7
 
4.4%
e 7
 
4.4%
M 7
 
4.4%
O 6
 
3.8%
i 6
 
3.8%
E 6
 
3.8%
Other values (33) 82
51.9%
Common
ValueCountFrequency (%)
2 27
22.7%
1 24
20.2%
3 17
14.3%
10
 
8.4%
5 8
 
6.7%
6 6
 
5.0%
7 6
 
5.0%
0 5
 
4.2%
- 5
 
4.2%
4 5
 
4.2%
Other values (3) 6
 
5.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 984
78.0%
ASCII 276
 
21.9%
Number Forms 1
 
0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
201
 
20.4%
34
 
3.5%
24
 
2.4%
21
 
2.1%
18
 
1.8%
16
 
1.6%
16
 
1.6%
14
 
1.4%
14
 
1.4%
13
 
1.3%
Other values (205) 613
62.3%
ASCII
ValueCountFrequency (%)
2 27
 
9.8%
1 24
 
8.7%
3 17
 
6.2%
I 12
 
4.3%
10
 
3.6%
N 9
 
3.3%
A 9
 
3.3%
5 8
 
2.9%
n 7
 
2.5%
o 7
 
2.5%
Other values (45) 146
52.9%
Number Forms
ValueCountFrequency (%)
1
100.0%

총톤수
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct154
Distinct (%)47.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.2463497
Minimum0
Maximum19
Zeros47
Zeros (%)14.4%
Negative0
Negative (%)0.0%
Memory size3.0 KiB
2023-12-11T09:18:41.602449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10.48
median0.82
Q31.3375
95-th percentile3.6
Maximum19
Range19
Interquartile range (IQR)0.8575

Descriptive statistics

Standard deviation1.9023448
Coefficient of variation (CV)1.5263331
Kurtosis48.345367
Mean1.2463497
Median Absolute Deviation (MAD)0.42
Skewness5.9967873
Sum406.31
Variance3.6189156
MonotonicityNot monotonic
2023-12-11T09:18:41.790465image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.0 47
 
14.4%
0.77 8
 
2.5%
0.71 7
 
2.1%
0.68 6
 
1.8%
0.8 6
 
1.8%
1.11 5
 
1.5%
0.53 5
 
1.5%
0.61 5
 
1.5%
1.0 5
 
1.5%
0.89 5
 
1.5%
Other values (144) 227
69.6%
ValueCountFrequency (%)
0.0 47
14.4%
0.14 1
 
0.3%
0.16 2
 
0.6%
0.25 1
 
0.3%
0.26 3
 
0.9%
0.27 1
 
0.3%
0.28 2
 
0.6%
0.29 1
 
0.3%
0.3 1
 
0.3%
0.31 3
 
0.9%
ValueCountFrequency (%)
19.0 2
0.6%
11.0 1
0.3%
7.8 1
0.3%
7.31 1
0.3%
6.67 1
0.3%
6.63 1
0.3%
6.0 1
0.3%
5.72 1
0.3%
4.99 1
0.3%
4.82 1
0.3%

승선정원
Real number (ℝ)

HIGH CORRELATION 

Distinct14
Distinct (%)4.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.4785276
Minimum1
Maximum27
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.0 KiB
2023-12-11T09:18:41.949589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q13
median5
Q36
95-th percentile12
Maximum27
Range26
Interquartile range (IQR)3

Descriptive statistics

Standard deviation3.403776
Coefficient of variation (CV)0.62129395
Kurtosis12.03678
Mean5.4785276
Median Absolute Deviation (MAD)1
Skewness2.7335069
Sum1786
Variance11.585691
MonotonicityNot monotonic
2023-12-11T09:18:42.080118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=14)
ValueCountFrequency (%)
3 70
21.5%
4 62
19.0%
5 56
17.2%
6 46
14.1%
12 22
 
6.7%
2 20
 
6.1%
8 18
 
5.5%
10 14
 
4.3%
7 8
 
2.5%
1 3
 
0.9%
Other values (4) 7
 
2.1%
ValueCountFrequency (%)
1 3
 
0.9%
2 20
 
6.1%
3 70
21.5%
4 62
19.0%
5 56
17.2%
6 46
14.1%
7 8
 
2.5%
8 18
 
5.5%
9 3
 
0.9%
10 14
 
4.3%
ValueCountFrequency (%)
27 2
 
0.6%
24 1
 
0.3%
20 1
 
0.3%
12 22
 
6.7%
10 14
 
4.3%
9 3
 
0.9%
8 18
 
5.5%
7 8
 
2.5%
6 46
14.1%
5 56
17.2%

데이터기준일자
Date

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.7 KiB
Minimum2022-11-07 00:00:00
Maximum2022-11-07 00:00:00
2023-12-11T09:18:42.171376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:18:42.252019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=1)

Interactions

2023-12-11T09:18:39.157642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:18:38.567651image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:18:38.886325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:18:39.239822image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:18:38.669584image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:18:38.975661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:18:39.326166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:18:38.788919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-11T09:18:39.063774image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-11T09:18:42.320832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자료일련번호소유구분기구종류총톤수승선정원
자료일련번호1.0000.4010.2050.2010.144
소유구분0.4011.0000.4780.3360.218
기구종류0.2050.4781.0000.5680.580
총톤수0.2010.3360.5681.0000.763
승선정원0.1440.2180.5800.7631.000
2023-12-11T09:18:42.429854image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기구종류소유구분
기구종류1.0000.410
소유구분0.4101.000
2023-12-11T09:18:42.522811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
자료일련번호총톤수승선정원소유구분기구종류
자료일련번호1.0000.1410.2180.2600.087
총톤수0.1411.0000.7600.2380.406
승선정원0.2180.7601.0000.1400.403
소유구분0.2600.2380.1401.0000.410
기구종류0.0870.4060.4030.4101.000

Missing values

2023-12-11T09:18:39.462337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-11T09:18:39.592934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

자료일련번호등록일자소유구분기구종류기구명총톤수승선정원데이터기준일자
012020-07-15개인모터보트(선외기)서영호0.2532022-11-07
122022-11-01개인모터보트(선외기)moon0.3732022-11-07
232022-10-24개인모터보트(선내기)봉진1.952022-11-07
342022-10-05개인모터보트(선외기)창성호0.7742022-11-07
452022-09-27개인모터보트(선외기)세준호0.4632022-11-07
562022-09-27개인모터보트(선외기)에메랄드5호0.042022-11-07
672022-09-05개인모터보트(선외기)베스트호0.4342022-11-07
782022-08-11기타수상오토바이남해1호0.022022-11-07
892022-08-11기타수상오토바이남해2호0.022022-11-07
9102022-08-11기타수상오토바이남해3호0.032022-11-07
자료일련번호등록일자소유구분기구종류기구명총톤수승선정원데이터기준일자
3163172017-06-26개인수상오토바이컨퀘스트2호0.032022-11-07
3173182016-11-23개인수상오토바이제우스1호0.022022-11-07
3183192018-04-11개인모터보트(선외기)워라밸0.3142022-11-07
3193202020-04-02개인수상오토바이혜빈0.032022-11-07
3203212019-11-07개인모터보트(선외기)해진0.7362022-11-07
3213222015-12-03개인모터보트(선외기)HOYA KINGDOM2.4122022-11-07
3223232015-11-09개인모터보트(선외기)마인드0.7952022-11-07
3233242007-04-02개인모터보트(선외기)목화호1.1352022-11-07
3243252021-09-07개인모터보트(선외기)아라미르0.7542022-11-07
3253262015-03-13개인모터보트(선외기)MERIL호1.3862022-11-07