Overview

Dataset statistics

Number of variables7
Number of observations48
Missing cells0
Missing cells (%)0.0%
Duplicate rows24
Duplicate rows (%)50.0%
Total size in memory3.1 KiB
Average record size in memory65.8 B

Variable types

Categorical2
Numeric5

Alerts

배출년도 has constant value ""Constant
배출월 has constant value ""Constant
Dataset has 24 (50.0%) duplicate rowsDuplicates
배출시 is highly overall correlated with 배출량(g) and 3 other fieldsHigh correlation
배출량(g) is highly overall correlated with 배출시 and 3 other fieldsHigh correlation
배출량비율(%) is highly overall correlated with 배출시 and 3 other fieldsHigh correlation
배출횟수 is highly overall correlated with 배출시 and 3 other fieldsHigh correlation
배출횟수비율(%) is highly overall correlated with 배출시 and 3 other fieldsHigh correlation
배출시 has 2 (4.2%) zerosZeros

Reproduction

Analysis started2023-12-10 11:59:55.859504
Analysis finished2023-12-10 11:59:59.402356
Duration3.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

배출년도
Categorical

CONSTANT 

Distinct1
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size516.0 B
2021
48 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021
2nd row2021
3rd row2021
4th row2021
5th row2021

Common Values

ValueCountFrequency (%)
2021 48
100.0%

Length

2023-12-10T20:59:59.845216image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T20:59:59.996991image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021 48
100.0%

배출월
Categorical

CONSTANT 

Distinct1
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Memory size516.0 B
3
48 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row3
2nd row3
3rd row3
4th row3
5th row3

Common Values

ValueCountFrequency (%)
3 48
100.0%

Length

2023-12-10T21:00:00.165383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:00:00.299899image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 48
100.0%

배출시
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct24
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.5
Minimum0
Maximum23
Zeros2
Zeros (%)4.2%
Negative0
Negative (%)0.0%
Memory size564.0 B
2023-12-10T21:00:00.443069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile1
Q15.75
median11.5
Q317.25
95-th percentile22
Maximum23
Range23
Interquartile range (IQR)11.5

Descriptive statistics

Standard deviation6.9954392
Coefficient of variation (CV)0.60829906
Kurtosis-1.2034843
Mean11.5
Median Absolute Deviation (MAD)6
Skewness0
Sum552
Variance48.93617
MonotonicityNot monotonic
2023-12-10T21:00:00.660330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
0 2
 
4.2%
13 2
 
4.2%
23 2
 
4.2%
22 2
 
4.2%
21 2
 
4.2%
20 2
 
4.2%
19 2
 
4.2%
18 2
 
4.2%
17 2
 
4.2%
16 2
 
4.2%
Other values (14) 28
58.3%
ValueCountFrequency (%)
0 2
4.2%
1 2
4.2%
2 2
4.2%
3 2
4.2%
4 2
4.2%
5 2
4.2%
6 2
4.2%
7 2
4.2%
8 2
4.2%
9 2
4.2%
ValueCountFrequency (%)
23 2
4.2%
22 2
4.2%
21 2
4.2%
20 2
4.2%
19 2
4.2%
18 2
4.2%
17 2
4.2%
16 2
4.2%
15 2
4.2%
14 2
4.2%

배출량(g)
Real number (ℝ)

HIGH CORRELATION 

Distinct24
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.9529074 × 109
Minimum1.3218857 × 108
Maximum5.3555082 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size564.0 B
2023-12-10T21:00:00.853160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.3218857 × 108
5-th percentile1.8262364 × 108
Q11.320014 × 109
median3.7367012 × 109
Q34.0360627 × 109
95-th percentile5.2946061 × 109
Maximum5.3555082 × 109
Range5.2233196 × 109
Interquartile range (IQR)2.7160487 × 109

Descriptive statistics

Standard deviation1.6971296 × 109
Coefficient of variation (CV)0.57473176
Kurtosis-1.1157877
Mean2.9529074 × 109
Median Absolute Deviation (MAD)7.1055512 × 108
Skewness-0.5683647
Sum1.4173955 × 1011
Variance2.880249 × 1018
MonotonicityNot monotonic
2023-12-10T21:00:01.056767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
753318786 2
 
4.2%
4142671061 2
 
4.2%
1504999428 2
 
4.2%
2670238796 2
 
4.2%
4478299011 2
 
4.2%
5294606093 2
 
4.2%
5355508189 2
 
4.2%
4416213554 2
 
4.2%
3938009658 2
 
4.2%
3874779755 2
 
4.2%
Other values (14) 28
58.3%
ValueCountFrequency (%)
132188566 2
4.2%
182623639 2
4.2%
263324563 2
4.2%
318486765 2
4.2%
753318786 2
4.2%
765057695 2
4.2%
1504999428 2
4.2%
2210280644 2
4.2%
2670238796 2
4.2%
3402020652 2
4.2%
ValueCountFrequency (%)
5355508189 2
4.2%
5294606093 2
4.2%
4478299011 2
4.2%
4416213554 2
4.2%
4273749065 2
4.2%
4142671061 2
4.2%
4000526626 2
4.2%
3949516190 2
4.2%
3938009658 2
4.2%
3922645741 2
4.2%

배출량비율(%)
Real number (ℝ)

HIGH CORRELATION 

Distinct24
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.1675
Minimum0.19
Maximum7.56
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size564.0 B
2023-12-10T21:00:01.240119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.19
5-th percentile0.26
Q11.86
median5.275
Q35.6925
95-th percentile7.47
Maximum7.56
Range7.37
Interquartile range (IQR)3.8325

Descriptive statistics

Standard deviation2.3950716
Coefficient of variation (CV)0.57470225
Kurtosis-1.1166234
Mean4.1675
Median Absolute Deviation (MAD)1
Skewness-0.56834609
Sum200.04
Variance5.7363681
MonotonicityNot monotonic
2023-12-10T21:00:01.424503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
1.06 2
 
4.2%
5.85 2
 
4.2%
2.12 2
 
4.2%
3.77 2
 
4.2%
6.32 2
 
4.2%
7.47 2
 
4.2%
7.56 2
 
4.2%
6.23 2
 
4.2%
5.56 2
 
4.2%
5.47 2
 
4.2%
Other values (14) 28
58.3%
ValueCountFrequency (%)
0.19 2
4.2%
0.26 2
4.2%
0.37 2
4.2%
0.45 2
4.2%
1.06 2
4.2%
1.08 2
4.2%
2.12 2
4.2%
3.12 2
4.2%
3.77 2
4.2%
4.8 2
4.2%
ValueCountFrequency (%)
7.56 2
4.2%
7.47 2
4.2%
6.32 2
4.2%
6.23 2
4.2%
6.03 2
4.2%
5.85 2
4.2%
5.64 2
4.2%
5.57 2
4.2%
5.56 2
4.2%
5.54 2
4.2%

배출횟수
Real number (ℝ)

HIGH CORRELATION 

Distinct24
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1662028.2
Minimum48623
Maximum3228136
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size564.0 B
2023-12-10T21:00:01.625685image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum48623
5-th percentile67067
Q1673810
median2190656.5
Q32304603.5
95-th percentile3163046
Maximum3228136
Range3179513
Interquartile range (IQR)1630793.5

Descriptive statistics

Standard deviation1029858.4
Coefficient of variation (CV)0.61963957
Kurtosis-1.2338126
Mean1662028.2
Median Absolute Deviation (MAD)489435
Skewness-0.42262221
Sum79777352
Variance1.0606084 × 1012
MonotonicityNot monotonic
2023-12-10T21:00:01.826376image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
389896 2
 
4.2%
2307629 2
 
4.2%
827852 2
 
4.2%
1519341 2
 
4.2%
2602865 2
 
4.2%
3163046 2
 
4.2%
3228136 2
 
4.2%
2604389 2
 
4.2%
2265857 2
 
4.2%
2243024 2
 
4.2%
Other values (14) 28
58.3%
ValueCountFrequency (%)
48623 2
4.2%
67067 2
4.2%
102849 2
4.2%
150238 2
4.2%
283653 2
4.2%
389896 2
4.2%
768448 2
4.2%
827852 2
4.2%
1519341 2
4.2%
1625519 2
4.2%
ValueCountFrequency (%)
3228136 2
4.2%
3163046 2
4.2%
2604389 2
4.2%
2602865 2
4.2%
2449802 2
4.2%
2307629 2
4.2%
2303595 2
4.2%
2265857 2
4.2%
2261127 2
4.2%
2243024 2
4.2%

배출횟수비율(%)
Real number (ℝ)

HIGH CORRELATION 

Distinct22
Distinct (%)45.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.1679167
Minimum0.12
Maximum8.09
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size564.0 B
2023-12-10T21:00:02.014896image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.12
5-th percentile0.17
Q11.6925
median5.49
Q35.7825
95-th percentile7.93
Maximum8.09
Range7.97
Interquartile range (IQR)4.09

Descriptive statistics

Standard deviation2.5813092
Coefficient of variation (CV)0.61932841
Kurtosis-1.2334495
Mean4.1679167
Median Absolute Deviation (MAD)1.225
Skewness-0.42335331
Sum200.06
Variance6.6631573
MonotonicityNot monotonic
2023-12-10T21:00:02.226229image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
5.49 4
 
8.3%
6.53 4
 
8.3%
0.98 2
 
4.2%
2.08 2
 
4.2%
3.81 2
 
4.2%
7.93 2
 
4.2%
8.09 2
 
4.2%
5.68 2
 
4.2%
5.62 2
 
4.2%
5.54 2
 
4.2%
Other values (12) 24
50.0%
ValueCountFrequency (%)
0.12 2
4.2%
0.17 2
4.2%
0.26 2
4.2%
0.38 2
4.2%
0.71 2
4.2%
0.98 2
4.2%
1.93 2
4.2%
2.08 2
4.2%
3.81 2
4.2%
4.08 2
4.2%
ValueCountFrequency (%)
8.09 2
4.2%
7.93 2
4.2%
6.53 4
8.3%
6.14 2
4.2%
5.79 2
4.2%
5.78 2
4.2%
5.68 2
4.2%
5.67 2
4.2%
5.62 2
4.2%
5.54 2
4.2%

Interactions

2023-12-10T20:59:58.433140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:56.033449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:56.629457image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:57.175237image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:57.785383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:58.549011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:56.124882image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:56.726328image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:57.289119image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:57.903149image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:58.668545image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:56.230380image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:56.829384image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:57.405637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:58.039357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:58.778190image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:56.403392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:56.933154image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:57.539445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:58.159484image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:58.908934image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:56.518139image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:57.040703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:57.659017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T20:59:58.292564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T21:00:02.388472image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
배출시배출량(g)배출량비율(%)배출횟수배출횟수비율(%)
배출시1.0000.8520.8590.8280.837
배출량(g)0.8521.0001.0000.9370.937
배출량비율(%)0.8591.0001.0000.9370.939
배출횟수0.8280.9370.9371.0001.000
배출횟수비율(%)0.8370.9370.9391.0001.000
2023-12-10T21:00:02.545395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
배출시배출량(g)배출량비율(%)배출횟수배출횟수비율(%)
배출시1.0000.6580.6580.6640.667
배출량(g)0.6581.0001.0000.9840.987
배출량비율(%)0.6581.0001.0000.9840.987
배출횟수0.6640.9840.9841.0001.000
배출횟수비율(%)0.6670.9870.9871.0001.000

Missing values

2023-12-10T20:59:59.123780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T20:59:59.303960image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

배출년도배출월배출시배출량(g)배출량비율(%)배출횟수배출횟수비율(%)
02021307533187861.063898960.98
12021313184867650.451502380.38
22021321826236390.26670670.17
32021331321885660.19486230.12
42021342633245630.371028490.26
52021357650576951.082836530.71
620213622102806443.127684481.93
720213734020206524.816255194.08
820213839226457415.5422611275.67
920213942737490656.0324498026.14
배출년도배출월배출시배출량(g)배출량비율(%)배출횟수배출횟수비율(%)
38202131439495161905.5721902835.49
39202131538030849715.3722095365.54
40202131638747797555.4722430245.62
41202131739380096585.5622658575.68
42202131844162135546.2326043896.53
43202131953555081897.5632281368.09
44202132052946060937.4731630467.93
45202132144782990116.3226028656.53
46202132226702387963.7715193413.81
47202132315049994282.128278522.08

Duplicate rows

Most frequently occurring

배출년도배출월배출시배출량(g)배출량비율(%)배출횟수배출횟수비율(%)# duplicates
02021307533187861.063898960.982
12021313184867650.451502380.382
22021321826236390.26670670.172
32021331321885660.19486230.122
42021342633245630.371028490.262
52021357650576951.082836530.712
620213622102806443.127684481.932
720213734020206524.816255194.082
820213839226457415.5422611275.672
920213942737490656.0324498026.142