Overview

Dataset statistics

Number of variables2
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory1.9 KiB
Average record size in memory19.3 B

Variable types

Numeric1
Categorical1

Alerts

통합대기환경지수값 is highly imbalanced (80.6%)Imbalance
인덱스 ID has unique valuesUnique

Reproduction

Analysis started2023-12-10 10:21:51.939566
Analysis finished2023-12-10 10:21:52.410596
Duration0.47 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

인덱스 ID
Real number (ℝ)

UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean124.66
Minimum1
Maximum240
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T19:21:52.545838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q164.75
median128.5
Q3186
95-th percentile231.25
Maximum240
Range239
Interquartile range (IQR)121.25

Descriptive statistics

Standard deviation71.46352
Coefficient of variation (CV)0.57326745
Kurtosis-1.0539929
Mean124.66
Median Absolute Deviation (MAD)61
Skewness-0.19615308
Sum12466
Variance5107.0347
MonotonicityStrictly increasing
2023-12-10T19:21:52.810422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
163 1
 
1.0%
185 1
 
1.0%
184 1
 
1.0%
183 1
 
1.0%
182 1
 
1.0%
175 1
 
1.0%
174 1
 
1.0%
169 1
 
1.0%
168 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
12 1
1.0%
ValueCountFrequency (%)
240 1
1.0%
239 1
1.0%
238 1
1.0%
237 1
1.0%
236 1
1.0%
231 1
1.0%
230 1
1.0%
229 1
1.0%
228 1
1.0%
215 1
1.0%

통합대기환경지수값
Categorical

IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
0
97 
1
 
3

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 97
97.0%
1 3
 
3.0%

Length

2023-12-10T19:21:53.055921image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T19:21:53.586412image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 97
97.0%
1 3
 
3.0%

Interactions

2023-12-10T19:21:52.045407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T19:21:53.692587image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인덱스 ID통합대기환경지수값
인덱스 ID1.0000.216
통합대기환경지수값0.2161.000
2023-12-10T19:21:53.830436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
인덱스 ID통합대기환경지수값
인덱스 ID1.0000.156
통합대기환경지수값0.1561.000

Missing values

2023-12-10T19:21:52.234133image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T19:21:52.365998image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

인덱스 ID통합대기환경지수값
010
120
230
340
450
560
670
780
890
9120
인덱스 ID통합대기환경지수값
902151
912280
922290
932300
942310
952360
962370
972380
982390
992400