Overview

Dataset statistics

Number of variables6
Number of observations75
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory3.9 KiB
Average record size in memory53.8 B

Variable types

Categorical3
Numeric3

Dataset

DescriptionNAPPO(북미식물보호기구)에서 요구하는 북미국가(미국, 캐나다 등), 칠레, 뉴질랜드 등 출항선박에 대한 선박 AGM 검사 통계정보
Author국제식물검역인증원
URLhttps://www.data.go.kr/data/15012980/fileData.do

Alerts

검사수수료(천원) is highly overall correlated with 선박 중량High correlation
건수(기본) is highly overall correlated with 건수(할증)High correlation
건수(할증) is highly overall correlated with 건수(기본)High correlation
선박 종류 is highly overall correlated with 선박 중량High correlation
선박 중량 is highly overall correlated with 검사수수료(천원) and 1 other fieldsHigh correlation
건수(기본) has 7 (9.3%) zerosZeros
건수(할증) has 6 (8.0%) zerosZeros

Reproduction

Analysis started2023-12-12 11:59:13.278321
Analysis finished2023-12-12 11:59:14.455625
Duration1.18 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

검사년도
Categorical

Distinct5
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Memory size732.0 B
2017
15 
2016
15 
2015
15 
2014
15 
2013
15 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2017
2nd row2017
3rd row2017
4th row2017
5th row2017

Common Values

ValueCountFrequency (%)
2017 15
20.0%
2016 15
20.0%
2015 15
20.0%
2014 15
20.0%
2013 15
20.0%

Length

2023-12-12T20:59:14.552549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:59:14.748825image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2017 15
20.0%
2016 15
20.0%
2015 15
20.0%
2014 15
20.0%
2013 15
20.0%

선박 종류
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)5.3%
Missing0
Missing (%)0.0%
Memory size732.0 B
컨테이너
25 
기타선박류
20 
벌크선
15 
차량운반선
15 

Length

Max length5
Median length4
Mean length4.2666667
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row벌크선
2nd row차량운반선
3rd row벌크선
4th row컨테이너
5th row컨테이너

Common Values

ValueCountFrequency (%)
컨테이너 25
33.3%
기타선박류 20
26.7%
벌크선 15
20.0%
차량운반선 15
20.0%

Length

2023-12-12T20:59:14.920863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:59:15.114314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
컨테이너 25
33.3%
기타선박류 20
26.7%
벌크선 15
20.0%
차량운반선 15
20.0%

선박 중량
Categorical

HIGH CORRELATION 

Distinct9
Distinct (%)12.0%
Missing0
Missing (%)0.0%
Memory size732.0 B
2.5만톤 미만
15 
2.5만~4만톤 미만
15 
4만톤 이상
10 
7만톤 이상
10 
2만톤 미만
Other values (4)
20 

Length

Max length11
Median length9
Mean length8.2
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2.5만톤 미만
2nd row4만톤 이상
3rd row4만톤 이상
4th row2만톤 미만
5th row2만~3만톤 미만

Common Values

ValueCountFrequency (%)
2.5만톤 미만 15
20.0%
2.5만~4만톤 미만 15
20.0%
4만톤 이상 10
13.3%
7만톤 이상 10
13.3%
2만톤 미만 5
 
6.7%
2만~3만톤 미만 5
 
6.7%
3만~5만톤 미만 5
 
6.7%
5만~7만톤 미만 5
 
6.7%
4만~7만톤 미만 5
 
6.7%

Length

2023-12-12T20:59:15.296083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-12T20:59:15.472831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
미만 55
36.7%
이상 20
 
13.3%
2.5만톤 15
 
10.0%
2.5만~4만톤 15
 
10.0%
4만톤 10
 
6.7%
7만톤 10
 
6.7%
2만톤 5
 
3.3%
2만~3만톤 5
 
3.3%
3만~5만톤 5
 
3.3%
5만~7만톤 5
 
3.3%

검사수수료(천원)
Real number (ℝ)

HIGH CORRELATION 

Distinct10
Distinct (%)13.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean996.13333
Minimum80
Maximum2250
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size807.0 B
2023-12-12T20:59:15.630916image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum80
5-th percentile120
Q1200
median1125
Q31500
95-th percentile2250
Maximum2250
Range2170
Interquartile range (IQR)1300

Descriptive statistics

Standard deviation761.39611
Coefficient of variation (CV)0.7643516
Kurtosis-1.4976804
Mean996.13333
Median Absolute Deviation (MAD)750
Skewness0.12988349
Sum74710
Variance579724.04
MonotonicityNot monotonic
2023-12-12T20:59:15.763694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
1125 12
16.0%
1875 12
16.0%
1500 12
16.0%
120 8
10.7%
200 8
10.7%
160 8
10.7%
2250 6
8.0%
240 4
 
5.3%
750 3
 
4.0%
80 2
 
2.7%
ValueCountFrequency (%)
80 2
 
2.7%
120 8
10.7%
160 8
10.7%
200 8
10.7%
240 4
 
5.3%
750 3
 
4.0%
1125 12
16.0%
1500 12
16.0%
1875 12
16.0%
2250 6
8.0%
ValueCountFrequency (%)
2250 6
8.0%
1875 12
16.0%
1500 12
16.0%
1125 12
16.0%
750 3
 
4.0%
240 4
 
5.3%
200 8
10.7%
160 8
10.7%
120 8
10.7%
80 2
 
2.7%

건수(기본)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct56
Distinct (%)74.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean91.826667
Minimum0
Maximum272
Zeros7
Zeros (%)9.3%
Negative0
Negative (%)0.0%
Memory size807.0 B
2023-12-12T20:59:15.976526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q17
median86
Q3144
95-th percentile253.1
Maximum272
Range272
Interquartile range (IQR)137

Descriptive statistics

Standard deviation80.569992
Coefficient of variation (CV)0.87741388
Kurtosis-0.64681309
Mean91.826667
Median Absolute Deviation (MAD)76
Skewness0.5491432
Sum6887
Variance6491.5236
MonotonicityNot monotonic
2023-12-12T20:59:16.170753image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0 7
 
9.3%
2 3
 
4.0%
7 3
 
4.0%
3 3
 
4.0%
6 2
 
2.7%
135 2
 
2.7%
122 2
 
2.7%
98 2
 
2.7%
1 2
 
2.7%
64 2
 
2.7%
Other values (46) 47
62.7%
ValueCountFrequency (%)
0 7
9.3%
1 2
 
2.7%
2 3
4.0%
3 3
4.0%
4 1
 
1.3%
6 2
 
2.7%
7 3
4.0%
10 1
 
1.3%
16 1
 
1.3%
22 1
 
1.3%
ValueCountFrequency (%)
272 1
1.3%
261 1
1.3%
260 1
1.3%
258 1
1.3%
251 1
1.3%
249 1
1.3%
232 1
1.3%
206 1
1.3%
189 1
1.3%
175 1
1.3%

건수(할증)
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct52
Distinct (%)69.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean78.626667
Minimum0
Maximum295
Zeros6
Zeros (%)8.0%
Negative0
Negative (%)0.0%
Memory size807.0 B
2023-12-12T20:59:16.377612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q15.5
median52
Q3128.5
95-th percentile228.2
Maximum295
Range295
Interquartile range (IQR)123

Descriptive statistics

Standard deviation78.007064
Coefficient of variation (CV)0.99211968
Kurtosis0.061057379
Mean78.626667
Median Absolute Deviation (MAD)50
Skewness0.96032375
Sum5897
Variance6085.102
MonotonicityNot monotonic
2023-12-12T20:59:16.561787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2 7
 
9.3%
0 6
 
8.0%
33 4
 
5.3%
3 3
 
4.0%
51 2
 
2.7%
31 2
 
2.7%
40 2
 
2.7%
44 2
 
2.7%
203 2
 
2.7%
1 2
 
2.7%
Other values (42) 43
57.3%
ValueCountFrequency (%)
0 6
8.0%
1 2
 
2.7%
2 7
9.3%
3 3
4.0%
4 1
 
1.3%
7 1
 
1.3%
9 1
 
1.3%
12 1
 
1.3%
27 1
 
1.3%
28 1
 
1.3%
ValueCountFrequency (%)
295 1
1.3%
271 1
1.3%
255 1
1.3%
252 1
1.3%
218 1
1.3%
214 1
1.3%
203 2
2.7%
202 1
1.3%
180 1
1.3%
175 1
1.3%

Interactions

2023-12-12T20:59:14.023770image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:59:13.505381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:59:13.735404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:59:14.093325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:59:13.576835image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:59:13.853973image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:59:14.165199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:59:13.658810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-12T20:59:13.934308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-12T20:59:16.698949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
검사년도선박 종류선박 중량검사수수료(천원)건수(기본)건수(할증)
검사년도1.0000.0000.0000.5830.0000.000
선박 종류0.0001.0000.7380.0000.6520.667
선박 중량0.0000.7381.0000.8640.6290.492
검사수수료(천원)0.5830.0000.8641.0000.5010.354
건수(기본)0.0000.6520.6290.5011.0000.911
건수(할증)0.0000.6670.4920.3540.9111.000
2023-12-12T20:59:16.821158image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
검사년도선박 종류선박 중량
검사년도1.0000.0000.000
선박 종류0.0001.0000.558
선박 중량0.0000.5581.000
2023-12-12T20:59:16.974875image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
검사수수료(천원)건수(기본)건수(할증)검사년도선박 종류선박 중량
검사수수료(천원)1.0000.3140.3130.4390.0000.632
건수(기본)0.3141.0000.9320.0000.4320.343
건수(할증)0.3130.9321.0000.0000.4460.244
검사년도0.4390.0000.0001.0000.0000.000
선박 종류0.0000.4320.4460.0001.0000.558
선박 중량0.6320.3430.2440.0000.5581.000

Missing values

2023-12-12T20:59:14.277316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-12T20:59:14.403161image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

검사년도선박 종류선박 중량검사수수료(천원)건수(기본)건수(할증)
02017벌크선2.5만톤 미만1125189125
12017차량운반선4만톤 이상187513651
22017벌크선4만톤 이상1875163218
32017컨테이너2만톤 미만75021
42017컨테이너2만~3만톤 미만112570
52017컨테이너3만~5만톤 미만15004033
62017컨테이너5만~7만톤 미만18759092
72017컨테이너7만톤 이상2250272214
82017기타선박류2.5만톤 미만11256453
92017기타선박류2.5만~4만톤 미만150014690
검사년도선박 종류선박 중량검사수수료(천원)건수(기본)건수(할증)
652013기타선박류2.5만~4만톤 미만1605533
662013기타선박류2.5만톤 미만1204657
672013컨테이너7만톤 이상24013599
682013컨테이너5만~7만톤 미만20013485
692013컨테이너3만~5만톤 미만1608060
702013컨테이너2만~3만톤 미만12072
712013컨테이너2만톤 미만8003
722013벌크선4만톤 이상200122150
732013벌크선2.5만톤 미만120174160
742013벌크선2.5만~4만톤 미만160261203