Overview

Dataset statistics

Number of variables12
Number of observations99
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory9.9 KiB
Average record size in memory102.3 B

Variable types

Categorical10
Numeric2

Dataset

DescriptionSample
Author(주)제로투원파트너스
URLhttps://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=ZTO010BSICGIDIAPER

Alerts

MT has constant value ""Constant
201802 has constant value ""Constant
기저귀 has constant value ""Constant
전체 has constant value ""Constant
전체.1 has constant value ""Constant
깨끗한나라 보솜이 has constant value ""Constant
20 is highly overall correlated with 1High correlation
1 is highly overall correlated with 20High correlation
108 is highly overall correlated with 7High correlation
7 is highly overall correlated with 108High correlation
전체.2 is highly overall correlated with 전체.3High correlation
전체.3 is highly overall correlated with 전체.2High correlation
1 is highly imbalanced (80.4%)Imbalance

Reproduction

Analysis started2023-12-10 06:41:34.718618
Analysis finished2023-12-10 06:41:36.628209
Duration1.91 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

MT
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
MT
99 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMT
2nd rowMT
3rd rowMT
4th rowMT
5th rowMT

Common Values

ValueCountFrequency (%)
MT 99
100.0%

Length

2023-12-10T15:41:36.729663image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:41:36.856120image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
mt 99
100.0%

201802
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
201802
99 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row201802
2nd row201802
3rd row201802
4th row201802
5th row201802

Common Values

ValueCountFrequency (%)
201802 99
100.0%

Length

2023-12-10T15:41:37.002416image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:41:37.140590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
201802 99
100.0%

108
Real number (ℝ)

HIGH CORRELATION 

Distinct74
Distinct (%)74.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean101.70707
Minimum11
Maximum288
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1023.0 B
2023-12-10T15:41:37.309711image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11
5-th percentile30
Q170.5
median93
Q3139
95-th percentile169.1
Maximum288
Range277
Interquartile range (IQR)68.5

Descriptive statistics

Standard deviation49.619419
Coefficient of variation (CV)0.48786598
Kurtosis1.2349653
Mean101.70707
Median Absolute Deviation (MAD)40
Skewness0.72929553
Sum10069
Variance2462.0868
MonotonicityNot monotonic
2023-12-10T15:41:37.519615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
53 4
 
4.0%
157 4
 
4.0%
139 3
 
3.0%
143 3
 
3.0%
86 3
 
3.0%
72 2
 
2.0%
79 2
 
2.0%
33 2
 
2.0%
142 2
 
2.0%
78 2
 
2.0%
Other values (64) 72
72.7%
ValueCountFrequency (%)
11 1
1.0%
13 1
1.0%
24 1
1.0%
26 1
1.0%
30 2
2.0%
33 2
2.0%
36 1
1.0%
43 1
1.0%
44 1
1.0%
45 1
1.0%
ValueCountFrequency (%)
288 1
 
1.0%
245 1
 
1.0%
202 1
 
1.0%
200 1
 
1.0%
170 1
 
1.0%
169 1
 
1.0%
168 1
 
1.0%
165 2
2.0%
157 4
4.0%
152 1
 
1.0%

7
Real number (ℝ)

HIGH CORRELATION 

Distinct26
Distinct (%)26.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11.474747
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1023.0 B
2023-12-10T15:41:37.742596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.9
Q15
median7
Q311
95-th percentile34.1
Maximum100
Range99
Interquartile range (IQR)6

Descriptive statistics

Standard deviation15.395695
Coefficient of variation (CV)1.3417023
Kurtosis16.156964
Mean11.474747
Median Absolute Deviation (MAD)3
Skewness3.7830826
Sum1136
Variance237.02742
MonotonicityNot monotonic
2023-12-10T15:41:37.918150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
5 13
13.1%
3 12
12.1%
6 10
10.1%
7 8
 
8.1%
9 6
 
6.1%
8 6
 
6.1%
4 6
 
6.1%
11 5
 
5.1%
10 5
 
5.1%
14 4
 
4.0%
Other values (16) 24
24.2%
ValueCountFrequency (%)
1 4
 
4.0%
2 1
 
1.0%
3 12
12.1%
4 6
6.1%
5 13
13.1%
6 10
10.1%
7 8
8.1%
8 6
6.1%
9 6
6.1%
10 5
 
5.1%
ValueCountFrequency (%)
100 1
1.0%
80 1
1.0%
65 1
1.0%
63 1
1.0%
35 1
1.0%
34 1
1.0%
28 1
1.0%
27 1
1.0%
26 1
1.0%
24 1
1.0%

기저귀
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
기저귀
99 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row기저귀
2nd row기저귀
3rd row기저귀
4th row기저귀
5th row기저귀

Common Values

ValueCountFrequency (%)
기저귀 99
100.0%

Length

2023-12-10T15:41:38.079885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:41:38.200378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
기저귀 99
100.0%

전체
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
전체
99 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전체
2nd row전체
3rd row전체
4th row전체
5th row전체

Common Values

ValueCountFrequency (%)
전체 99
100.0%

Length

2023-12-10T15:41:38.326224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:41:38.443482image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전체 99
100.0%

전체.1
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
전체
99 

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전체
2nd row전체
3rd row전체
4th row전체
5th row전체

Common Values

ValueCountFrequency (%)
전체 99
100.0%

Length

2023-12-10T15:41:38.561241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:41:38.663727image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전체 99
100.0%

1
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
1
96 
2
 
3

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
1 96
97.0%
2 3
 
3.0%

Length

2023-12-10T15:41:38.773655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:41:38.887444image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 96
97.0%
2 3
 
3.0%

20
Categorical

HIGH CORRELATION 

Distinct5
Distinct (%)5.1%
Missing0
Missing (%)0.0%
Memory size924.0 B
30
46 
40
23 
20
18 
50
70
 
3

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20
2nd row20
3rd row20
4th row20
5th row20

Common Values

ValueCountFrequency (%)
30 46
46.5%
40 23
23.2%
20 18
 
18.2%
50 9
 
9.1%
70 3
 
3.0%

Length

2023-12-10T15:41:38.989183image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:41:39.137695image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
30 46
46.5%
40 23
23.2%
20 18
 
18.2%
50 9
 
9.1%
70 3
 
3.0%

전체.2
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)17.2%
Missing0
Missing (%)0.0%
Memory size924.0 B
충청남도
15 
경기도
15 
서울특별시
울산광역시
인천광역시
Other values (12)
43 

Length

Max length7
Median length5
Mean length4.2525253
Min length2

Unique

Unique2 ?
Unique (%)2.0%

Sample

1st row광주광역시
2nd row울산광역시
3rd row강원도
4th row충청남도
5th row경상남도

Common Values

ValueCountFrequency (%)
충청남도 15
15.2%
경기도 15
15.2%
서울특별시 9
9.1%
울산광역시 9
9.1%
인천광역시 8
8.1%
광주광역시 7
7.1%
강원도 7
7.1%
부산광역시 5
 
5.1%
경상남도 5
 
5.1%
전라북도 4
 
4.0%
Other values (7) 15
15.2%

Length

2023-12-10T15:41:39.384257image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
충청남도 15
15.2%
경기도 15
15.2%
서울특별시 9
9.1%
울산광역시 9
9.1%
인천광역시 8
8.1%
광주광역시 7
7.1%
강원도 7
7.1%
경상남도 5
 
5.1%
부산광역시 5
 
5.1%
전라북도 4
 
4.0%
Other values (7) 15
15.2%

전체.3
Categorical

HIGH CORRELATION 

Distinct41
Distinct (%)41.4%
Missing0
Missing (%)0.0%
Memory size924.0 B
전체
19 
서구
북구
 
4
부산진구
 
3
구리시
 
3
Other values (36)
63 

Length

Max length4
Median length3
Mean length2.7070707
Min length2

Unique

Unique14 ?
Unique (%)14.1%

Sample

1st row서구
2nd row전체
3rd row춘천시
4th row천안시
5th row전체

Common Values

ValueCountFrequency (%)
전체 19
 
19.2%
서구 7
 
7.1%
북구 4
 
4.0%
부산진구 3
 
3.0%
구리시 3
 
3.0%
광산구 3
 
3.0%
홍성군 3
 
3.0%
도봉구 3
 
3.0%
천안시 3
 
3.0%
춘천시 3
 
3.0%
Other values (31) 48
48.5%

Length

2023-12-10T15:41:39.604067image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
전체 19
 
19.2%
서구 7
 
7.1%
북구 4
 
4.0%
홍성군 3
 
3.0%
천안시 3
 
3.0%
도봉구 3
 
3.0%
춘천시 3
 
3.0%
광산구 3
 
3.0%
구리시 3
 
3.0%
부산진구 3
 
3.0%
Other values (31) 48
48.5%

깨끗한나라 보솜이
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size924.0 B
깨끗한나라 보솜이
99 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row깨끗한나라 보솜이
2nd row깨끗한나라 보솜이
3rd row깨끗한나라 보솜이
4th row깨끗한나라 보솜이
5th row깨끗한나라 보솜이

Common Values

ValueCountFrequency (%)
깨끗한나라 보솜이 99
100.0%

Length

2023-12-10T15:41:39.823247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:41:39.976684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
깨끗한나라 99
50.0%
보솜이 99
50.0%

Interactions

2023-12-10T15:41:35.586539image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:41:35.297199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:41:35.737579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:41:35.432338image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:41:40.065955image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
1087120전체.2전체.3
1081.0000.3980.0000.0000.3490.661
70.3981.0000.3840.4620.6610.706
10.0000.3841.0001.0000.3230.000
200.0000.4621.0001.0000.0000.000
전체.20.3490.6610.3230.0001.0000.975
전체.30.6610.7060.0000.0000.9751.000
2023-12-10T15:41:40.236777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
전체.3전체.2201
전체.31.0000.6360.0000.000
전체.20.6361.0000.0000.264
200.0000.0001.0000.984
10.0000.2640.9841.000
2023-12-10T15:41:40.391397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
1087120전체.2전체.3
1081.0000.5820.0000.0000.1340.238
70.5821.0000.4000.3140.3500.291
10.0000.4001.0000.9840.2640.000
200.0000.3140.9841.0000.0000.000
전체.20.1340.3500.2640.0001.0000.636
전체.30.2380.2910.0000.0000.6361.000

Missing values

2023-12-10T15:41:35.921241image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:41:36.526773image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

MT2018021087기저귀전체전체.1120전체.2전체.3깨끗한나라 보솜이
0MT201802363기저귀전체전체120광주광역시서구깨끗한나라 보솜이
1MT20180213963기저귀전체전체120울산광역시전체깨끗한나라 보솜이
2MT20180210412기저귀전체전체120강원도춘천시깨끗한나라 보솜이
3MT201802757기저귀전체전체120충청남도천안시깨끗한나라 보솜이
4MT201802464기저귀전체전체120경상남도전체깨끗한나라 보솜이
5MT2018021126기저귀전체전체130전체전체깨끗한나라 보솜이
6MT201802494기저귀전체전체130서울특별시도봉구깨끗한나라 보솜이
7MT20180213713기저귀전체전체130부산광역시부산진구깨끗한나라 보솜이
8MT201802673기저귀전체전체130부산광역시기장군깨끗한나라 보솜이
9MT2018021206기저귀전체전체130대구광역시북구깨끗한나라 보솜이
MT2018021087기저귀전체전체.1120전체.2전체.3깨끗한나라 보솜이
89MT2018021024기저귀전체전체130경기도용인시깨끗한나라 보솜이
90MT201802795기저귀전체전체130경기도안성시깨끗한나라 보솜이
91MT20180214310기저귀전체전체130강원도전체깨끗한나라 보솜이
92MT20180215711기저귀전체전체130충청북도충주시깨끗한나라 보솜이
93MT2018021489기저귀전체전체130충청남도전체깨끗한나라 보솜이
94MT2018021436기저귀전체전체130충청남도아산시깨끗한나라 보솜이
95MT201802773기저귀전체전체130충청남도서산시깨끗한나라 보솜이
96MT20180215112기저귀전체전체130충청남도당진시깨끗한나라 보솜이
97MT20180216510기저귀전체전체130충청남도홍성군깨끗한나라 보솜이
98MT201802703기저귀전체전체130전라남도여수시깨끗한나라 보솜이