Overview

Dataset statistics

Number of variables5
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.4 KiB
Average record size in memory45.3 B

Variable types

Numeric2
Categorical3

Alerts

측정소코드 has constant value ""Constant
측정항목코드 has constant value ""Constant
측정수치 is highly overall correlated with 측정데이터시간구분High correlation
측정데이터시간구분 is highly overall correlated with 측정수치High correlation
측정수치 has unique valuesUnique

Reproduction

Analysis started2023-12-10 12:05:19.817934
Analysis finished2023-12-10 12:05:21.354987
Duration1.54 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

날짜
Real number (ℝ)

Distinct34
Distinct (%)34.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0200402 × 1013
Minimum2.0200401 × 1013
Maximum2.0200403 × 1013
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:05:21.483139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0200401 × 1013
5-th percentile2.0200401 × 1013
Q12.0200401 × 1013
median2.0200402 × 1013
Q32.0200403 × 1013
95-th percentile2.0200403 × 1013
Maximum2.0200403 × 1013
Range2180000
Interquartile range (IQR)1845000

Descriptive statistics

Standard deviation792646.28
Coefficient of variation (CV)3.9239134 × 10-8
Kurtosis-1.4109434
Mean2.0200402 × 1013
Median Absolute Deviation (MAD)920000
Skewness0.12289008
Sum2.0200402 × 1015
Variance6.2828812 × 1011
MonotonicityIncreasing
2023-12-10T21:05:21.744786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=34)
ValueCountFrequency (%)
20200401000000 3
 
3.0%
20200403020000 3
 
3.0%
20200402140000 3
 
3.0%
20200402160000 3
 
3.0%
20200402180000 3
 
3.0%
20200402200000 3
 
3.0%
20200402220000 3
 
3.0%
20200403000000 3
 
3.0%
20200403040000 3
 
3.0%
20200401020000 3
 
3.0%
Other values (24) 70
70.0%
ValueCountFrequency (%)
20200401000000 3
3.0%
20200401020000 3
3.0%
20200401040000 3
3.0%
20200401060000 3
3.0%
20200401080000 3
3.0%
20200401100000 3
3.0%
20200401120000 3
3.0%
20200401140000 3
3.0%
20200401160000 3
3.0%
20200401180000 3
3.0%
ValueCountFrequency (%)
20200403180000 1
 
1.0%
20200403160000 3
3.0%
20200403140000 3
3.0%
20200403120000 3
3.0%
20200403100000 3
3.0%
20200403080000 3
3.0%
20200403060000 3
3.0%
20200403040000 3
3.0%
20200403020000 3
3.0%
20200403000000 3
3.0%

측정소코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 100
100.0%

Length

2023-12-10T21:05:21.976262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:05:22.132180image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 100
100.0%

측정항목코드
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
90319
100 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row90319
2nd row90319
3rd row90319
4th row90319
5th row90319

Common Values

ValueCountFrequency (%)
90319 100
100.0%

Length

2023-12-10T21:05:22.310626image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:05:22.475914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
90319 100
100.0%

측정데이터시간구분
Categorical

HIGH CORRELATION 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
RH02
34 
RH24
33 
RY01
33 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowRH02
2nd rowRH24
3rd rowRY01
4th rowRH02
5th rowRH24

Common Values

ValueCountFrequency (%)
RH02 34
34.0%
RH24 33
33.0%
RY01 33
33.0%

Length

2023-12-10T21:05:22.651717image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:05:22.894370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
rh02 34
34.0%
rh24 33
33.0%
ry01 33
33.0%

측정수치
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean77.675378
Minimum0
Maximum206.4552
Zeros1
Zeros (%)1.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T21:05:23.274806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile16.545465
Q158.4908
median58.6479
Q3102.04995
95-th percentile147.15461
Maximum206.4552
Range206.4552
Interquartile range (IQR)43.55915

Descriptive statistics

Standard deviation41.750197
Coefficient of variation (CV)0.5374959
Kurtosis1.0090573
Mean77.675378
Median Absolute Deviation (MAD)11.23275
Skewness1.0516283
Sum7767.5378
Variance1743.079
MonotonicityNot monotonic
2023-12-10T21:05:23.593010image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
85.2143 1
 
1.0%
42.8416 1
 
1.0%
58.6313 1
 
1.0%
47.9103 1
 
1.0%
58.0972 1
 
1.0%
58.6277 1
 
1.0%
44.2145 1
 
1.0%
63.1916 1
 
1.0%
58.6237 1
 
1.0%
40.7411 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
0.0 1
1.0%
10.9231 1
1.0%
12.1034 1
1.0%
12.2712 1
1.0%
13.748 1
1.0%
16.6927 1
1.0%
21.5101 1
1.0%
32.044 1
1.0%
40.7411 1
1.0%
42.8416 1
1.0%
ValueCountFrequency (%)
206.4552 1
1.0%
200.1142 1
1.0%
186.3634 1
1.0%
185.4272 1
1.0%
168.1403 1
1.0%
146.0501 1
1.0%
144.5484 1
1.0%
144.5167 1
1.0%
143.9657 1
1.0%
141.4145 1
1.0%

Interactions

2023-12-10T21:05:20.366685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:05:20.033574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:05:20.921920image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:05:20.209569image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T21:05:23.759903image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
날짜측정데이터시간구분측정수치
날짜1.0000.0000.715
측정데이터시간구분0.0001.0000.729
측정수치0.7150.7291.000
2023-12-10T21:05:23.902195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
날짜측정수치측정데이터시간구분
날짜1.000-0.2550.000
측정수치-0.2551.0000.571
측정데이터시간구분0.0000.5711.000

Missing values

2023-12-10T21:05:21.120403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T21:05:21.290622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

날짜측정소코드측정항목코드측정데이터시간구분측정수치
020200401000000190319RH0285.2143
120200401000000190319RH24101.1921
220200401000000190319RY0158.3686
320200401020000190319RH0269.3755
420200401020000190319RH24101.8384
520200401020000190319RY0158.379
620200401040000190319RH02105.3637
720200401040000190319RH24106.1207
820200401040000190319RY0158.3979
920200401060000190319RH02144.5167
날짜측정소코드측정항목코드측정데이터시간구분측정수치
9020200403120000190319RH02117.6346
9120200403120000190319RH2464.8593
9220200403120000190319RY0158.652
9320200403140000190319RH0298.7796
9420200403140000190319RH2466.7581
9520200403140000190319RY0158.6571
9620200403160000190319RH02102.6846
9720200403160000190319RH2470.5714
9820200403160000190319RY0158.6654
9920200403180000190319RH02107.5769