Overview

Dataset statistics

Number of variables6
Number of observations400
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory20.1 KiB
Average record size in memory51.3 B

Variable types

Numeric3
Categorical2
DateTime1

Dataset

DescriptionSample
Author코난테크놀로지
URLhttps://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=TPOTIME

Alerts

"채널값" has constant value ""Constant
"차례값" is highly overall correlated with "건수값" and 1 other fieldsHigh correlation
"건수값" is highly overall correlated with "차례값"High correlation
"이슈어값" is highly overall correlated with "차례값"High correlation
"기본키값" has unique valuesUnique

Reproduction

Analysis started2023-12-10 06:14:01.122083
Analysis finished2023-12-10 06:14:02.946893
Duration1.82 second
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

"기본키값"
Real number (ℝ)

UNIQUE 

Distinct400
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean140200.84
Minimum16441
Maximum279667
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:14:03.083827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum16441
5-th percentile16460.95
Q165720.75
median133901.5
Q3213305.25
95-th percentile263843.05
Maximum279667
Range263226
Interquartile range (IQR)147584.5

Descriptive statistics

Standard deviation79534.028
Coefficient of variation (CV)0.56728639
Kurtosis-1.2048663
Mean140200.84
Median Absolute Deviation (MAD)68195
Skewness0.10392607
Sum56080336
Variance6.3256616 × 109
MonotonicityStrictly increasing
2023-12-10T15:14:03.320420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
16441 1
 
0.2%
183219 1
 
0.2%
183229 1
 
0.2%
183228 1
 
0.2%
183227 1
 
0.2%
183226 1
 
0.2%
183225 1
 
0.2%
183224 1
 
0.2%
183223 1
 
0.2%
183222 1
 
0.2%
Other values (390) 390
97.5%
ValueCountFrequency (%)
16441 1
0.2%
16442 1
0.2%
16443 1
0.2%
16444 1
0.2%
16445 1
0.2%
16446 1
0.2%
16447 1
0.2%
16448 1
0.2%
16449 1
0.2%
16450 1
0.2%
ValueCountFrequency (%)
279667 1
0.2%
279666 1
0.2%
279665 1
0.2%
279664 1
0.2%
279663 1
0.2%
279662 1
0.2%
279661 1
0.2%
279660 1
0.2%
279659 1
0.2%
279658 1
0.2%

"채널값"
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
"블로그"
400 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row"블로그"
2nd row"블로그"
3rd row"블로그"
4th row"블로그"
5th row"블로그"

Common Values

ValueCountFrequency (%)
"블로그" 400
100.0%

Length

2023-12-10T15:14:03.583981image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:14:03.758868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
블로그 400
100.0%
Distinct16
Distinct (%)4.0%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
Minimum2020-05-01 00:00:00
Maximum2020-05-17 00:00:00
2023-12-10T15:14:03.895259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:14:04.419196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=16)

"차례값"
Real number (ℝ)

HIGH CORRELATION 

Distinct26
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.3
Minimum1
Maximum26
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:14:04.618794image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q17
median13
Q320
95-th percentile25
Maximum26
Range25
Interquartile range (IQR)13

Descriptive statistics

Standard deviation7.5333843
Coefficient of variation (CV)0.56641988
Kurtosis-1.2134051
Mean13.3
Median Absolute Deviation (MAD)7
Skewness0.038234162
Sum5320
Variance56.75188
MonotonicityNot monotonic
2023-12-10T15:14:04.828140image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
1 16
 
4.0%
7 16
 
4.0%
2 16
 
4.0%
9 16
 
4.0%
8 16
 
4.0%
10 16
 
4.0%
6 16
 
4.0%
5 16
 
4.0%
4 16
 
4.0%
3 16
 
4.0%
Other values (16) 240
60.0%
ValueCountFrequency (%)
1 16
4.0%
2 16
4.0%
3 16
4.0%
4 16
4.0%
5 16
4.0%
6 16
4.0%
7 16
4.0%
8 16
4.0%
9 16
4.0%
10 16
4.0%
ValueCountFrequency (%)
26 15
3.8%
25 15
3.8%
24 15
3.8%
23 15
3.8%
22 15
3.8%
21 15
3.8%
20 15
3.8%
19 15
3.8%
18 15
3.8%
17 15
3.8%

"이슈어값"
Categorical

HIGH CORRELATION 

Distinct26
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
"점심시간"
 
16
"오후9시"
 
16
"오후8시"
 
16
"저녁시간"
 
16
"오후6시"
 
16
Other values (21)
320 

Length

Max length7
Median length6
Mean length6
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row"점심시간"
2nd row"오후10시"
3rd row"자정"
4th row"오후6시"
5th row"오후9시"

Common Values

ValueCountFrequency (%)
"점심시간" 16
 
4.0%
"오후9시" 16
 
4.0%
"오후8시" 16
 
4.0%
"저녁시간" 16
 
4.0%
"오후6시" 16
 
4.0%
"자정" 16
 
4.0%
"오후3시" 16
 
4.0%
"오후5시" 16
 
4.0%
"오전10시" 16
 
4.0%
"오후10시" 16
 
4.0%
Other values (16) 240
60.0%

Length

2023-12-10T15:14:05.047100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
점심시간 16
 
4.0%
오후3시 16
 
4.0%
오후9시 16
 
4.0%
오후10시 16
 
4.0%
오후5시 16
 
4.0%
오전10시 16
 
4.0%
자정 16
 
4.0%
오후6시 16
 
4.0%
저녁시간 16
 
4.0%
오후8시 16
 
4.0%
Other values (16) 240
60.0%

"건수값"
Real number (ℝ)

HIGH CORRELATION 

Distinct355
Distinct (%)88.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1276.6375
Minimum263
Maximum5728
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:14:05.262199image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum263
5-th percentile322
Q1451
median1321.5
Q31756
95-th percentile2608.85
Maximum5728
Range5465
Interquartile range (IQR)1305

Descriptive statistics

Standard deviation813.68312
Coefficient of variation (CV)0.63736427
Kurtosis2.4025502
Mean1276.6375
Median Absolute Deviation (MAD)690.5
Skewness1.0029617
Sum510655
Variance662080.23
MonotonicityNot monotonic
2023-12-10T15:14:05.517640image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
384 3
 
0.8%
328 3
 
0.8%
1941 2
 
0.5%
326 2
 
0.5%
538 2
 
0.5%
378 2
 
0.5%
303 2
 
0.5%
300 2
 
0.5%
2040 2
 
0.5%
287 2
 
0.5%
Other values (345) 378
94.5%
ValueCountFrequency (%)
263 1
0.2%
264 1
0.2%
273 1
0.2%
287 2
0.5%
290 1
0.2%
294 1
0.2%
299 1
0.2%
300 2
0.5%
303 2
0.5%
304 1
0.2%
ValueCountFrequency (%)
5728 1
0.2%
4437 1
0.2%
4339 1
0.2%
4116 1
0.2%
3666 1
0.2%
3478 1
0.2%
3379 1
0.2%
3190 1
0.2%
3045 1
0.2%
2931 1
0.2%

Interactions

2023-12-10T15:14:02.270378image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:14:01.461977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:14:01.854514image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:14:02.412723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:14:01.583335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:14:01.994083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:14:02.551043image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:14:01.715971image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:14:02.129357image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:14:05.667546image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
"기본키값""해당일자""차례값""이슈어값""건수값"
"기본키값"1.0001.0000.0000.0000.319
"해당일자"1.0001.0000.0000.0000.283
"차례값"0.0000.0001.0000.8850.739
"이슈어값"0.0000.0000.8851.0000.815
"건수값"0.3190.2830.7390.8151.000
2023-12-10T15:14:05.815853image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
"기본키값""차례값""건수값""이슈어값"
"기본키값"1.0000.018-0.1320.000
"차례값"0.0181.000-0.9450.573
"건수값"-0.132-0.9451.0000.465
"이슈어값"0.0000.5730.4651.000

Missing values

2023-12-10T15:14:02.730385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:14:02.887931image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

"기본키값""채널값""해당일자""차례값""이슈어값""건수값"
016441"블로그"2020-05-011"점심시간"2514
116442"블로그"2020-05-012"오후10시"2468
216443"블로그"2020-05-013"자정"2440
316444"블로그"2020-05-014"오후6시"2190
416445"블로그"2020-05-015"오후9시"2136
516446"블로그"2020-05-016"오후8시"2135
616447"블로그"2020-05-017"오후5시"1952
716448"블로그"2020-05-018"오전10시"1928
816449"블로그"2020-05-019"오후3시"1755
916450"블로그"2020-05-0110"오후11시"1734
"기본키값""채널값""해당일자""차례값""이슈어값""건수값"
390279658"블로그"2020-05-171"자정"2599
391279659"블로그"2020-05-172"점심시간"2178
392279660"블로그"2020-05-173"오후8시"1500
393279661"블로그"2020-05-174"오후10시"1428
394279662"블로그"2020-05-175"오후6시"1308
395279663"블로그"2020-05-176"오후5시"1241
396279664"블로그"2020-05-177"오후3시"1149
397279665"블로그"2020-05-178"오후9시"1131
398279666"블로그"2020-05-179"저녁시간"1111
399279667"블로그"2020-05-1710"오전10시"1079