Overview

Dataset statistics

Number of variables6
Number of observations400
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory20.1 KiB
Average record size in memory51.3 B

Variable types

Numeric3
Categorical2
DateTime1

Dataset

DescriptionSample
Author코난테크놀로지
URLhttps://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=TPOMOMENT

Alerts

"채널값" has constant value ""Constant
"차례값" is highly overall correlated with "건수값" and 1 other fieldsHigh correlation
"건수값" is highly overall correlated with "차례값"High correlation
"이슈어값" is highly overall correlated with "차례값"High correlation
"기본키값" has unique valuesUnique

Reproduction

Analysis started2023-12-10 06:12:18.813735
Analysis finished2023-12-10 06:12:24.833144
Duration6.02 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

"기본키값"
Real number (ℝ)

UNIQUE 

Distinct400
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean86894.49
Minimum16098
Maximum165420
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:12:25.006152image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum16098
5-th percentile16117.95
Q148002.75
median81925.5
Q3133548.25
95-th percentile165400.05
Maximum165420
Range149322
Interquartile range (IQR)85545.5

Descriptive statistics

Standard deviation46278.476
Coefficient of variation (CV)0.5325824
Kurtosis-1.2208544
Mean86894.49
Median Absolute Deviation (MAD)34260
Skewness0.03676651
Sum34757796
Variance2.1416974 × 109
MonotonicityStrictly increasing
2023-12-10T15:12:25.230798image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
16098 1
 
0.2%
116166 1
 
0.2%
116176 1
 
0.2%
116175 1
 
0.2%
116174 1
 
0.2%
116173 1
 
0.2%
116172 1
 
0.2%
116171 1
 
0.2%
116170 1
 
0.2%
116169 1
 
0.2%
Other values (390) 390
97.5%
ValueCountFrequency (%)
16098 1
0.2%
16099 1
0.2%
16100 1
0.2%
16101 1
0.2%
16102 1
0.2%
16103 1
0.2%
16104 1
0.2%
16105 1
0.2%
16106 1
0.2%
16107 1
0.2%
ValueCountFrequency (%)
165420 1
0.2%
165419 1
0.2%
165418 1
0.2%
165417 1
0.2%
165416 1
0.2%
165415 1
0.2%
165414 1
0.2%
165413 1
0.2%
165412 1
0.2%
165411 1
0.2%

"채널값"
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
"블로그"
400 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row"블로그"
2nd row"블로그"
3rd row"블로그"
4th row"블로그"
5th row"블로그"

Common Values

ValueCountFrequency (%)
"블로그" 400
100.0%

Length

2023-12-10T15:12:25.454612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:12:25.629531image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
블로그 400
100.0%
Distinct10
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
Minimum2020-05-01 00:00:00
Maximum2020-05-10 00:00:00
2023-12-10T15:12:25.783491image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:12:25.960585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)

"차례값"
Real number (ℝ)

HIGH CORRELATION 

Distinct42
Distinct (%)10.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20.95
Minimum1
Maximum42
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:12:26.171017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2.95
Q110.75
median20.5
Q331
95-th percentile40
Maximum42
Range41
Interquartile range (IQR)20.25

Descriptive statistics

Standard deviation12.108428
Coefficient of variation (CV)0.57796794
Kurtosis-1.189319
Mean20.95
Median Absolute Deviation (MAD)10.5
Skewness0.068140791
Sum8380
Variance146.61404
MonotonicityNot monotonic
2023-12-10T15:12:26.441661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=42)
ValueCountFrequency (%)
1 10
 
2.5%
13 10
 
2.5%
2 10
 
2.5%
21 10
 
2.5%
20 10
 
2.5%
19 10
 
2.5%
18 10
 
2.5%
17 10
 
2.5%
16 10
 
2.5%
15 10
 
2.5%
Other values (32) 300
75.0%
ValueCountFrequency (%)
1 10
2.5%
2 10
2.5%
3 10
2.5%
4 10
2.5%
5 10
2.5%
6 10
2.5%
7 10
2.5%
8 10
2.5%
9 10
2.5%
10 10
2.5%
ValueCountFrequency (%)
42 9
2.2%
41 9
2.2%
40 9
2.2%
39 9
2.2%
38 9
2.2%
37 9
2.2%
36 9
2.2%
35 9
2.2%
34 9
2.2%
33 9
2.2%

"이슈어값"
Categorical

HIGH CORRELATION 

Distinct42
Distinct (%)10.5%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
"주말"
 
10
"기념일"
 
10
"스승의날"
 
10
"봄(春)"
 
10
"어린이날"
 
10
Other values (37)
350 

Length

Max length9
Median length8
Mean length5.465
Min length4

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row"주말"
2nd row"여름"
3rd row"겨울"
4th row"평일"
5th row"생일"

Common Values

ValueCountFrequency (%)
"주말" 10
 
2.5%
"기념일" 10
 
2.5%
"스승의날" 10
 
2.5%
"봄(春)" 10
 
2.5%
"어린이날" 10
 
2.5%
"어버이날" 10
 
2.5%
"겨울" 10
 
2.5%
"생일" 10
 
2.5%
"평일" 10
 
2.5%
"연말" 10
 
2.5%
Other values (32) 300
75.0%

Length

2023-12-10T15:12:26.678414image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
주말 10
 
2.5%
12월31일 10
 
2.5%
기념일 10
 
2.5%
100일 10
 
2.5%
설날 10
 
2.5%
연초 10
 
2.5%
여름 10
 
2.5%
졸업식 10
 
2.5%
화이트데이 10
 
2.5%
크리스마스 10
 
2.5%
Other values (32) 300
75.0%

"건수값"
Real number (ℝ)

HIGH CORRELATION 

Distinct313
Distinct (%)78.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1964.025
Minimum1
Maximum22043
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:12:26.952460image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile13.85
Q178.75
median244
Q31325.75
95-th percentile11357.2
Maximum22043
Range22042
Interquartile range (IQR)1247

Descriptive statistics

Standard deviation3890.1952
Coefficient of variation (CV)1.9807259
Kurtosis7.4990701
Mean1964.025
Median Absolute Deviation (MAD)213
Skewness2.7480073
Sum785610
Variance15133619
MonotonicityNot monotonic
2023-12-10T15:12:27.394148image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
8 5
 
1.2%
10 5
 
1.2%
118 4
 
1.0%
41 4
 
1.0%
120 4
 
1.0%
55 4
 
1.0%
44 3
 
0.8%
48 3
 
0.8%
103 3
 
0.8%
74 3
 
0.8%
Other values (303) 362
90.5%
ValueCountFrequency (%)
1 1
 
0.2%
3 1
 
0.2%
4 2
 
0.5%
5 2
 
0.5%
6 1
 
0.2%
7 2
 
0.5%
8 5
1.2%
10 5
1.2%
11 1
 
0.2%
14 2
 
0.5%
ValueCountFrequency (%)
22043 1
0.2%
19616 1
0.2%
18698 1
0.2%
18364 1
0.2%
18219 1
0.2%
17217 1
0.2%
16340 1
0.2%
16053 1
0.2%
15851 1
0.2%
15679 1
0.2%

Interactions

2023-12-10T15:12:23.801012image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:12:22.627401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:12:23.291130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:12:23.939445image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:12:22.826220image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:12:23.448458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:12:24.076618image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:12:23.135782image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:12:23.632977image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:12:27.732267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
"기본키값""해당일자""차례값""이슈어값""건수값"
"기본키값"1.0001.0000.0000.0000.000
"해당일자"1.0001.0000.0000.0000.000
"차례값"0.0000.0001.0000.9600.821
"이슈어값"0.0000.0000.9601.0000.855
"건수값"0.0000.0000.8210.8551.000
2023-12-10T15:12:28.689214image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
"기본키값""차례값""건수값""이슈어값"
"기본키값"1.0000.027-0.0570.000
"차례값"0.0271.000-0.9950.732
"건수값"-0.057-0.9951.0000.482
"이슈어값"0.0000.7320.4821.000

Missing values

2023-12-10T15:12:24.258737image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:12:24.519392image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

"기본키값""채널값""해당일자""차례값""이슈어값""건수값"
016098"블로그"2020-05-011"주말"16340
116099"블로그"2020-05-012"여름"14848
216100"블로그"2020-05-013"겨울"9290
316101"블로그"2020-05-014"평일"7469
416102"블로그"2020-05-015"생일"5226
516103"블로그"2020-05-016"가을"4164
616104"블로그"2020-05-017"어버이날"4053
716105"블로그"2020-05-018"어린이날"3715
816106"블로그"2020-05-019"봄(春)"2548
916107"블로그"2020-05-0110"석가탄신일"2342
"기본키값""채널값""해당일자""차례값""이슈어값""건수값"
390165411"블로그"2020-05-1013"스승의날"754
391165412"블로그"2020-05-1014"추석"583
392165413"블로그"2020-05-1015"100일"504
393165414"블로그"2020-05-1016"설날"298
394165415"블로그"2020-05-1017"결혼기념일"291
395165416"블로그"2020-05-1018"연초"280
396165417"블로그"2020-05-1019"석가탄신일"277
397165418"블로그"2020-05-1020"12월31일"178
398165419"블로그"2020-05-1021"졸업식"147
399165420"블로그"2020-05-1022"화이트데이"138