Overview

Dataset statistics

Number of variables7
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory6.1 KiB
Average record size in memory62.3 B

Variable types

Numeric2
Categorical5

Alerts

측정일 has constant value ""Constant
강수량(mm) has constant value ""Constant
강우량(mm) has constant value ""Constant
기본키 is highly overall correlated with 측정시간 and 2 other fieldsHigh correlation
측정시간 is highly overall correlated with 기본키High correlation
지점 is highly overall correlated with 기본키 and 1 other fieldsHigh correlation
주소 is highly overall correlated with 기본키 and 1 other fieldsHigh correlation
지점 is highly imbalanced (71.4%)Imbalance
주소 is highly imbalanced (71.4%)Imbalance
기본키 has unique valuesUnique

Reproduction

Analysis started2023-12-10 13:42:50.475050
Analysis finished2023-12-10 13:42:51.640699
Duration1.17 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기본키
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:42:51.767919image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2023-12-10T22:42:52.026526image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

측정일
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
20201201
100 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20201201
2nd row20201201
3rd row20201201
4th row20201201
5th row20201201

Common Values

ValueCountFrequency (%)
20201201 100
100.0%

Length

2023-12-10T22:42:52.331928image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:42:52.462596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20201201 100
100.0%

측정시간
Real number (ℝ)

HIGH CORRELATION 

Distinct95
Distinct (%)95.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1209.8
Minimum15
Maximum2345
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T22:42:52.650121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum15
5-th percentile129.25
Q1626.25
median1237.5
Q31733.75
95-th percentile2230.75
Maximum2345
Range2330
Interquartile range (IQR)1107.5

Descriptive statistics

Standard deviation680.38199
Coefficient of variation (CV)0.56239213
Kurtosis-1.1816878
Mean1209.8
Median Absolute Deviation (MAD)570
Skewness-0.092958644
Sum120980
Variance462919.66
MonotonicityNot monotonic
2023-12-10T22:42:52.971004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1730 2
 
2.0%
1715 2
 
2.0%
1700 2
 
2.0%
1645 2
 
2.0%
1630 2
 
2.0%
15 1
 
1.0%
30 1
 
1.0%
1745 1
 
1.0%
1615 1
 
1.0%
1600 1
 
1.0%
Other values (85) 85
85.0%
ValueCountFrequency (%)
15 1
1.0%
30 1
1.0%
45 1
1.0%
100 1
1.0%
115 1
1.0%
130 1
1.0%
145 1
1.0%
200 1
1.0%
215 1
1.0%
230 1
1.0%
ValueCountFrequency (%)
2345 1
1.0%
2330 1
1.0%
2315 1
1.0%
2300 1
1.0%
2245 1
1.0%
2230 1
1.0%
2215 1
1.0%
2200 1
1.0%
2145 1
1.0%
2130 1
1.0%

지점
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
A-0010-1185E-8
95 
A-0010-3019E-6
 
5

Length

Max length14
Median length14
Mean length14
Min length14

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowA-0010-1185E-8
2nd rowA-0010-1185E-8
3rd rowA-0010-1185E-8
4th rowA-0010-1185E-8
5th rowA-0010-1185E-8

Common Values

ValueCountFrequency (%)
A-0010-1185E-8 95
95.0%
A-0010-3019E-6 5
 
5.0%

Length

2023-12-10T22:42:53.414296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:42:53.834045image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
a-0010-1185e-8 95
95.0%
a-0010-3019e-6 5
 
5.0%

주소
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
대구 동구 안심3동
95 
충북 청주시 흥덕구 강서1동
 
5

Length

Max length15
Median length10
Mean length10.25
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row대구 동구 안심3동
2nd row대구 동구 안심3동
3rd row대구 동구 안심3동
4th row대구 동구 안심3동
5th row대구 동구 안심3동

Common Values

ValueCountFrequency (%)
대구 동구 안심3동 95
95.0%
충북 청주시 흥덕구 강서1동 5
 
5.0%

Length

2023-12-10T22:42:54.285870image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:42:54.525757image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
대구 95
31.1%
동구 95
31.1%
안심3동 95
31.1%
충북 5
 
1.6%
청주시 5
 
1.6%
흥덕구 5
 
1.6%
강서1동 5
 
1.6%

강수량(mm)
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
0
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 100
100.0%

Length

2023-12-10T22:42:54.708926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:42:54.918047image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 100
100.0%

강우량(mm)
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
0
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 100
100.0%

Length

2023-12-10T22:42:55.164543image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T22:42:55.313687image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 100
100.0%

Interactions

2023-12-10T22:42:51.077272image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:42:50.757677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:42:51.211481image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T22:42:50.916402image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T22:42:55.479545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기본키측정시간지점주소
기본키1.0000.9780.8160.816
측정시간0.9781.0000.4190.419
지점0.8160.4191.0000.986
주소0.8160.4190.9861.000
2023-12-10T22:42:55.651855image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
주소지점
주소1.0000.894
지점0.8941.000
2023-12-10T22:42:55.774349image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기본키측정시간지점주소
기본키1.0000.9730.6220.622
측정시간0.9731.0000.3070.307
지점0.6220.3071.0000.894
주소0.6220.3070.8941.000

Missing values

2023-12-10T22:42:51.374369image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T22:42:51.562123image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

기본키측정일측정시간지점주소강수량(mm)강우량(mm)
012020120115A-0010-1185E-8대구 동구 안심3동00
122020120130A-0010-1185E-8대구 동구 안심3동00
232020120145A-0010-1185E-8대구 동구 안심3동00
3420201201100A-0010-1185E-8대구 동구 안심3동00
4520201201115A-0010-1185E-8대구 동구 안심3동00
5620201201130A-0010-1185E-8대구 동구 안심3동00
6720201201145A-0010-1185E-8대구 동구 안심3동00
7820201201200A-0010-1185E-8대구 동구 안심3동00
8920201201215A-0010-1185E-8대구 동구 안심3동00
91020201201230A-0010-1185E-8대구 동구 안심3동00
기본키측정일측정시간지점주소강수량(mm)강우량(mm)
9091202012012245A-0010-1185E-8대구 동구 안심3동00
9192202012012300A-0010-1185E-8대구 동구 안심3동00
9293202012012315A-0010-1185E-8대구 동구 안심3동00
9394202012012330A-0010-1185E-8대구 동구 안심3동00
9495202012012345A-0010-1185E-8대구 동구 안심3동00
9596202012011630A-0010-3019E-6충북 청주시 흥덕구 강서1동00
9697202012011645A-0010-3019E-6충북 청주시 흥덕구 강서1동00
9798202012011700A-0010-3019E-6충북 청주시 흥덕구 강서1동00
9899202012011715A-0010-3019E-6충북 청주시 흥덕구 강서1동00
99100202012011730A-0010-3019E-6충북 청주시 흥덕구 강서1동00