Overview

Dataset statistics

Number of variables6
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows19
Duplicate rows (%)19.0%
Total size in memory5.0 KiB
Average record size in memory51.3 B

Variable types

Categorical5
Numeric1

Dataset

DescriptionSample
Author(주)넥스트이지
URLhttps://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=NXEJEJUGFRSVA

Alerts

분석항목명 has constant value ""Constant
Dataset has 19 (19.0%) duplicate rowsDuplicates
총건수 is highly imbalanced (63.5%)Imbalance

Reproduction

Analysis started2023-12-10 06:46:16.693920
Analysis finished2023-12-10 06:46:17.277155
Duration0.58 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

분석대상명
Categorical

Distinct5
Distinct (%)5.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
아덴힐GC
62 
더클래식CC
22 
해비치CC
10 
오라CC
 
4
그린필드CC(구 제피로스)
 
2

Length

Max length14
Median length5
Mean length5.4
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row해비치CC
2nd row아덴힐GC
3rd row아덴힐GC
4th row아덴힐GC
5th row더클래식CC

Common Values

ValueCountFrequency (%)
아덴힐GC 62
62.0%
더클래식CC 22
 
22.0%
해비치CC 10
 
10.0%
오라CC 4
 
4.0%
그린필드CC(구 제피로스) 2
 
2.0%

Length

2023-12-10T15:46:17.363363image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:46:17.522621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
아덴힐gc 62
60.8%
더클래식cc 22
 
21.6%
해비치cc 10
 
9.8%
오라cc 4
 
3.9%
그린필드cc(구 2
 
2.0%
제피로스 2
 
2.0%

분석항목명
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
시간대별
100 

Length

Max length4
Median length4
Mean length4
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row시간대별
2nd row시간대별
3rd row시간대별
4th row시간대별
5th row시간대별

Common Values

ValueCountFrequency (%)
시간대별 100
100.0%

Length

2023-12-10T15:46:17.671307image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:46:17.799811image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
시간대별 100
100.0%
Distinct4
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
13시-18시
39 
08시-12시
31 
19시-24시
23 
01시-07시

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row08시-12시
2nd row08시-12시
3rd row13시-18시
4th row13시-18시
5th row08시-12시

Common Values

ValueCountFrequency (%)
13시-18시 39
39.0%
08시-12시 31
31.0%
19시-24시 23
23.0%
01시-07시 7
 
7.0%

Length

2023-12-10T15:46:17.929574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:46:18.076332image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
13시-18시 39
39.0%
08시-12시 31
31.0%
19시-24시 23
23.0%
01시-07시 7
 
7.0%

총건수
Categorical

IMBALANCE 

Distinct3
Distinct (%)3.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
4
88 
3
11 
2
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)1.0%

Sample

1st row4
2nd row4
3rd row4
4th row4
5th row4

Common Values

ValueCountFrequency (%)
4 88
88.0%
3 11
 
11.0%
2 1
 
1.0%

Length

2023-12-10T15:46:18.201491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:46:18.306011image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
4 88
88.0%
3 11
 
11.0%
2 1
 
1.0%

기준일자
Real number (ℝ)

Distinct36
Distinct (%)36.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20210182
Minimum20210102
Maximum20210310
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2023-12-10T15:46:18.433951image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20210102
5-th percentile20210104
Q120210121
median20210208
Q320210219
95-th percentile20210303
Maximum20210310
Range208
Interquartile range (IQR)98

Descriptive statistics

Standard deviation59.175598
Coefficient of variation (CV)2.9280091 × 10-6
Kurtosis-0.87196979
Mean20210182
Median Absolute Deviation (MAD)17
Skewness0.12009581
Sum2.0210182 × 109
Variance3501.7514
MonotonicityNot monotonic
2023-12-10T15:46:18.598393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=36)
ValueCountFrequency (%)
20210216 14
 
14.0%
20210123 7
 
7.0%
20210224 6
 
6.0%
20210203 6
 
6.0%
20210111 6
 
6.0%
20210208 6
 
6.0%
20210104 4
 
4.0%
20210219 4
 
4.0%
20210114 3
 
3.0%
20210211 3
 
3.0%
Other values (26) 41
41.0%
ValueCountFrequency (%)
20210102 2
 
2.0%
20210103 2
 
2.0%
20210104 4
4.0%
20210105 2
 
2.0%
20210109 1
 
1.0%
20210110 1
 
1.0%
20210111 6
6.0%
20210113 1
 
1.0%
20210114 3
3.0%
20210120 1
 
1.0%
ValueCountFrequency (%)
20210310 1
 
1.0%
20210305 1
 
1.0%
20210304 3
3.0%
20210303 1
 
1.0%
20210302 1
 
1.0%
20210228 1
 
1.0%
20210227 1
 
1.0%
20210225 3
3.0%
20210224 6
6.0%
20210222 3
3.0%

요일명
Categorical

Distinct7
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
21 
18 
17 
15 
12 
Other values (2)
17 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row
2nd row
3rd row
4th row
5th row

Common Values

ValueCountFrequency (%)
21
21.0%
18
18.0%
17
17.0%
15
15.0%
12
12.0%
11
11.0%
6
 
6.0%

Length

2023-12-10T15:46:18.727858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:46:18.866923image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
21
21.0%
18
18.0%
17
17.0%
15
15.0%
12
12.0%
11
11.0%
6
 
6.0%

Interactions

2023-12-10T15:46:16.954596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:46:18.981357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
분석대상명자료카테고리명총건수기준일자요일명
분석대상명1.0000.1550.0000.6290.361
자료카테고리명0.1551.0000.0000.2940.284
총건수0.0000.0001.0000.7080.313
기준일자0.6290.2940.7081.0000.558
요일명0.3610.2840.3130.5581.000
2023-12-10T15:46:19.081309image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
총건수요일명자료카테고리명분석대상명
총건수1.0000.2160.0000.000
요일명0.2161.0000.1940.237
자료카테고리명0.0000.1941.0000.125
분석대상명0.0000.2370.1251.000
2023-12-10T15:46:19.190866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
기준일자분석대상명자료카테고리명총건수요일명
기준일자1.0000.4850.1740.3830.351
분석대상명0.4851.0000.1250.0000.237
자료카테고리명0.1740.1251.0000.0000.194
총건수0.3830.0000.0001.0000.216
요일명0.3510.2370.1940.2161.000

Missing values

2023-12-10T15:46:17.094176image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:46:17.222548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

분석대상명분석항목명자료카테고리명총건수기준일자요일명
0해비치CC시간대별08시-12시420210114
1아덴힐GC시간대별08시-12시420210114
2아덴힐GC시간대별13시-18시420210124
3아덴힐GC시간대별13시-18시420210124
4더클래식CC시간대별08시-12시420210125
5아덴힐GC시간대별13시-18시420210214
6아덴힐GC시간대별13시-18시420210224
7아덴힐GC시간대별13시-18시420210224
8더클래식CC시간대별19시-24시420210310
9더클래식CC시간대별19시-24시420210120
분석대상명분석항목명자료카테고리명총건수기준일자요일명
90아덴힐GC시간대별08시-12시420210216
91아덴힐GC시간대별13시-18시420210216
92더클래식CC시간대별13시-18시420210216
93아덴힐GC시간대별19시-24시420210216
94아덴힐GC시간대별19시-24시420210216
95아덴힐GC시간대별19시-24시420210216
96아덴힐GC시간대별19시-24시420210216
97아덴힐GC시간대별08시-12시420210302
98해비치CC시간대별08시-12시320210109
99아덴힐GC시간대별08시-12시320210110

Duplicate rows

Most frequently occurring

분석대상명분석항목명자료카테고리명총건수기준일자요일명# duplicates
14아덴힐GC시간대별19시-24시4202102165
11아덴힐GC시간대별13시-18시4202102164
12아덴힐GC시간대별13시-18시4202102244
18해비치CC시간대별13시-18시4202101234
2아덴힐GC시간대별08시-12시4202102033
3아덴힐GC시간대별08시-12시4202102083
4아덴힐GC시간대별08시-12시4202102163
7아덴힐GC시간대별13시-18시4202101233
9아덴힐GC시간대별13시-18시4202102083
17해비치CC시간대별13시-18시4202101113