Overview

Dataset statistics

Number of variables3
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory332.0 KiB
Average record size in memory34.0 B

Variable types

Numeric2
Categorical1

Dataset

DescriptionSample
Author㈜전략해양
URLhttps://www.bigdata-coast.kr/gdsInfo/gdsInfoDetail.do?gdsCd=CT01RNS006

Alerts

GOCI2_VIDO_LO is highly overall correlated with PHAEOPHYTA_FQ_RTHigh correlation
PHAEOPHYTA_FQ_RT is highly overall correlated with GOCI2_VIDO_LOHigh correlation

Reproduction

Analysis started2024-03-13 12:43:51.849580
Analysis finished2024-03-13 12:43:53.344231
Duration1.49 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

GOCI2_VIDO_LO
Real number (ℝ)

HIGH CORRELATION 

Distinct9925
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean123.78543
Minimum117.7732
Maximum130.25516
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-13T21:43:53.448869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum117.7732
5-th percentile118.95373
Q1121.51538
median123.46027
Q3125.19999
95-th percentile129.95196
Maximum130.25516
Range12.481956
Interquartile range (IQR)3.6846066

Descriptive statistics

Standard deviation3.3537528
Coefficient of variation (CV)0.027093275
Kurtosis-0.56663291
Mean123.78543
Median Absolute Deviation (MAD)1.834816
Skewness0.47172166
Sum1237854.3
Variance11.247658
MonotonicityNot monotonic
2024-03-13T21:43:53.646290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
119.2479705811 3
 
< 0.1%
129.3799591064 2
 
< 0.1%
122.3508834839 2
 
< 0.1%
123.6556549072 2
 
< 0.1%
122.982749939 2
 
< 0.1%
129.8631134033 2
 
< 0.1%
121.8403320313 2
 
< 0.1%
124.3158111572 2
 
< 0.1%
123.4415206909 2
 
< 0.1%
122.8716049194 2
 
< 0.1%
Other values (9915) 9979
99.8%
ValueCountFrequency (%)
117.7732009888 1
< 0.1%
117.7810668945 1
< 0.1%
117.7864761353 1
< 0.1%
117.7899169922 1
< 0.1%
117.7923202515 1
< 0.1%
117.7933578491 1
< 0.1%
117.8012161255 1
< 0.1%
117.8078689575 1
< 0.1%
117.8100814819 1
< 0.1%
117.816696167 1
< 0.1%
ValueCountFrequency (%)
130.2551574707 1
< 0.1%
130.2466125488 1
< 0.1%
130.2461853027 1
< 0.1%
130.2431793213 1
< 0.1%
130.2410430908 1
< 0.1%
130.2393341064 1
< 0.1%
130.2363739014 1
< 0.1%
130.2361297607 1
< 0.1%
130.2359466553 1
< 0.1%
130.2353057861 1
< 0.1%

GOCI2_VIDO_LA
Real number (ℝ)

Distinct9932
Distinct (%)99.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.836972
Minimum36.363586
Maximum39.051601
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-03-13T21:43:53.891508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum36.363586
5-th percentile36.515955
Q137.200825
median37.954025
Q338.50368
95-th percentile38.897019
Maximum39.051601
Range2.688015
Interquartile range (IQR)1.3028555

Descriptive statistics

Standard deviation0.76880516
Coefficient of variation (CV)0.020318887
Kurtosis-1.1211237
Mean37.836972
Median Absolute Deviation (MAD)0.61734962
Skewness-0.32045111
Sum378369.72
Variance0.59106137
MonotonicityNot monotonic
2024-03-13T21:43:54.090348image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
38.480342865 3
 
< 0.1%
36.4024009705 2
 
< 0.1%
37.175491333 2
 
< 0.1%
36.6164169312 2
 
< 0.1%
37.1913719177 2
 
< 0.1%
36.6429862976 2
 
< 0.1%
38.151714325 2
 
< 0.1%
38.0961456299 2
 
< 0.1%
38.8119163513 2
 
< 0.1%
38.5844154358 2
 
< 0.1%
Other values (9922) 9979
99.8%
ValueCountFrequency (%)
36.3635864258 1
< 0.1%
36.3648910522 1
< 0.1%
36.3649635315 1
< 0.1%
36.3650398254 1
< 0.1%
36.3658103943 1
< 0.1%
36.3659706116 1
< 0.1%
36.3666114807 1
< 0.1%
36.3690109253 1
< 0.1%
36.3702087402 1
< 0.1%
36.370300293 1
< 0.1%
ValueCountFrequency (%)
39.0516014099 1
< 0.1%
39.0513420105 1
< 0.1%
39.0503158569 1
< 0.1%
39.0465049744 1
< 0.1%
39.046257019 1
< 0.1%
39.0442581177 1
< 0.1%
39.043762207 1
< 0.1%
39.0425300598 1
< 0.1%
39.0407752991 1
< 0.1%
39.0399894714 1
< 0.1%

PHAEOPHYTA_FQ_RT
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
NaN
7105 
0.0000
2895 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row NaN
2nd row NaN
3rd row NaN
4th row0.0000
5th row NaN

Common Values

ValueCountFrequency (%)
NaN 7105
71.0%
0.0000 2895
28.9%

Length

2024-03-13T21:43:54.275147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-13T21:43:54.410801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
nan 7105
71.0%
0.0000 2895
28.9%

Interactions

2024-03-13T21:43:52.480981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T21:43:52.174121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T21:43:52.622110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-03-13T21:43:52.324306image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-03-13T21:43:54.500931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
GOCI2_VIDO_LOGOCI2_VIDO_LAPHAEOPHYTA_FQ_RT
GOCI2_VIDO_LO1.0000.4580.640
GOCI2_VIDO_LA0.4581.0000.627
PHAEOPHYTA_FQ_RT0.6400.6271.000
2024-03-13T21:43:54.681237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
GOCI2_VIDO_LOGOCI2_VIDO_LAPHAEOPHYTA_FQ_RT
GOCI2_VIDO_LO1.000-0.3160.647
GOCI2_VIDO_LA-0.3161.0000.486
PHAEOPHYTA_FQ_RT0.6470.4861.000

Missing values

2024-03-13T21:43:53.193823image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2024-03-13T21:43:53.288840image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

GOCI2_VIDO_LOGOCI2_VIDO_LAPHAEOPHYTA_FQ_RT
61826122.53094537.648392NaN
48019121.32668338.030941NaN
39247119.65264938.257313NaN
82101124.54583736.9433020.0000
26813123.92270738.445343NaN
25569129.8290138.4537730.0000
88779123.918836.693134NaN
53933119.96289837.910294NaN
77995129.78385937.0980870.0000
1134130.22648638.947151NaN
GOCI2_VIDO_LOGOCI2_VIDO_LAPHAEOPHYTA_FQ_RT
53812129.57748437.8486670.0000
46309121.10650638.073765NaN
10836122.44321438.767513NaN
46917121.62351238.052624NaN
65379129.67648337.523750.0000
33038124.1696438.324177NaN
81668123.33934836.967518NaN
52447121.25408237.927353NaN
23497117.83905838.600357NaN
2635123.51593838.927837NaN