Overview

Dataset statistics

Number of variables5
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.4 KiB
Average record size in memory45.3 B

Variable types

Numeric2
Categorical3

Alerts

측정소코드 has constant value ""Constant
측정항목코드 has constant value ""Constant
측정수치 is highly overall correlated with 측정데이터시간구분High correlation
측정데이터시간구분 is highly overall correlated with 측정수치High correlation
측정수치 has unique valuesUnique

Reproduction

Analysis started2023-12-10 12:05:24.418501
Analysis finished2023-12-10 12:05:25.331342
Duration0.91 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

날짜
Real number (ℝ)

Distinct28
Distinct (%)28.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0200302 × 1013
Minimum2.0200301 × 1013
Maximum2.0200303 × 1013
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:05:25.423766image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0200301 × 1013
5-th percentile2.0200301 × 1013
Q12.0200301 × 1013
median2.0200302 × 1013
Q32.0200302 × 1013
95-th percentile2.0200303 × 1013
Maximum2.0200303 × 1013
Range2060000
Interquartile range (IQR)1045000

Descriptive statistics

Standard deviation703826.51
Coefficient of variation (CV)3.4842376 × 10-8
Kurtosis-1.1627605
Mean2.0200302 × 1013
Median Absolute Deviation (MAD)880000
Skewness0.1411157
Sum2.0200302 × 1015
Variance4.9537176 × 1011
MonotonicityIncreasing
2023-12-10T21:05:25.620931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=28)
ValueCountFrequency (%)
20200303040000 6
 
6.0%
20200303020000 6
 
6.0%
20200303000000 6
 
6.0%
20200302220000 6
 
6.0%
20200302200000 6
 
6.0%
20200302180000 6
 
6.0%
20200301000000 3
 
3.0%
20200301020000 3
 
3.0%
20200302160000 3
 
3.0%
20200302140000 3
 
3.0%
Other values (18) 52
52.0%
ValueCountFrequency (%)
20200301000000 3
3.0%
20200301020000 3
3.0%
20200301040000 3
3.0%
20200301060000 3
3.0%
20200301080000 3
3.0%
20200301100000 3
3.0%
20200301120000 3
3.0%
20200301140000 3
3.0%
20200301160000 3
3.0%
20200301180000 3
3.0%
ValueCountFrequency (%)
20200303060000 1
 
1.0%
20200303040000 6
6.0%
20200303020000 6
6.0%
20200303000000 6
6.0%
20200302220000 6
6.0%
20200302200000 6
6.0%
20200302180000 6
6.0%
20200302160000 3
3.0%
20200302140000 3
3.0%
20200302120000 3
3.0%

측정소코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 100
100.0%

Length

2023-12-10T21:05:25.850810image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:05:26.011429image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 100
100.0%

측정항목코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
90319
100 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row90319
2nd row90319
3rd row90319
4th row90319
5th row90319

Common Values

ValueCountFrequency (%)
90319 100
100.0%

Length

2023-12-10T21:05:26.242598image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:05:26.528694image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
90319 100
100.0%

측정데이터시간구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
RH02
34 
RH24
33 
RY01
33 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowRH02
2nd rowRH24
3rd rowRY01
4th rowRH02
5th rowRH24

Common Values

ValueCountFrequency (%)
RH02 34
34.0%
RH24 33
33.0%
RY01 33
33.0%

Length

2023-12-10T21:05:26.705502image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:05:26.920686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
rh02 34
34.0%
rh24 33
33.0%
ry01 33
33.0%

측정수치
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean39.859637
Minimum9.8935
Maximum59.9626
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:05:27.169415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum9.8935
5-th percentile19.537925
Q129.315075
median36.28225
Q356.77435
95-th percentile59.84342
Maximum59.9626
Range50.0691
Interquartile range (IQR)27.459275

Descriptive statistics

Standard deviation14.392158
Coefficient of variation (CV)0.36107098
Kurtosis-1.2751242
Mean39.859637
Median Absolute Deviation (MAD)11.3757
Skewness0.022425893
Sum3985.9637
Variance207.13422
MonotonicityNot monotonic
2023-12-10T21:05:27.465697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
24.296 1
 
1.0%
41.097 1
 
1.0%
56.6738 1
 
1.0%
59.932 1
 
1.0%
28.654 1
 
1.0%
37.6654 1
 
1.0%
35.506 1
 
1.0%
44.7423 1
 
1.0%
56.7136 1
 
1.0%
59.9626 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
9.8935 1
1.0%
10.6755 1
1.0%
12.6775 1
1.0%
18.5975 1
1.0%
18.786 1
1.0%
19.5775 1
1.0%
19.5865 1
1.0%
19.854 1
1.0%
20.732 1
1.0%
21.646 1
1.0%
ValueCountFrequency (%)
59.9626 1
1.0%
59.932 1
1.0%
59.9037 1
1.0%
59.8827 1
1.0%
59.8628 1
1.0%
59.8424 1
1.0%
57.3839 1
1.0%
57.3691 1
1.0%
57.3565 1
1.0%
57.3438 1
1.0%

Interactions

2023-12-10T21:05:24.860727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:05:24.562675image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:05:24.990227image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:05:24.678949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T21:05:27.640235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
날짜측정데이터시간구분측정수치
날짜1.0000.0000.470
측정데이터시간구분0.0001.0000.877
측정수치0.4700.8771.000
2023-12-10T21:05:27.797122image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
날짜측정수치측정데이터시간구분
날짜1.0000.0170.000
측정수치0.0171.0000.783
측정데이터시간구분0.0000.7831.000

Missing values

2023-12-10T21:05:25.149644image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T21:05:25.272756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

날짜측정소코드측정항목코드측정데이터시간구분측정수치
020200301000000190319RH0224.296
120200301000000190319RH2422.2481
220200301000000190319RY0157.3839
320200301020000190319RH0219.854
420200301020000190319RH2423.8895
520200301020000190319RY0157.3691
620200301040000190319RH0220.732
720200301040000190319RH2425.5171
820200301040000190319RY0157.3565
920200301060000190319RH0221.646
날짜측정소코드측정항목코드측정데이터시간구분측정수치
9020200303020000190319RH2430.6302
9120200303020000190319RY0159.8628
9220200303020000190319RY0156.5939
9320200303040000190319RH0232.7802
9420200303040000190319RH0222.478
9520200303040000190319RH2439.8463
9620200303040000190319RH2430.8712
9720200303040000190319RY0159.8424
9820200303040000190319RY0156.5719
9920200303060000190319RH0233.915