Overview

Dataset statistics

Number of variables5
Number of observations400
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.9 KiB
Average record size in memory43.3 B

Variable types

Numeric3
Categorical1
Text1

Dataset

DescriptionSample
Author코난테크놀로지
URLhttps://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=TOPICBRAND

Alerts

"기본키값" is highly overall correlated with "해당일시"High correlation
"차례값" is highly overall correlated with "건수값"High correlation
"건수값" is highly overall correlated with "차례값"High correlation
"해당일시" is highly overall correlated with "기본키값"High correlation
"기본키값" has unique valuesUnique

Reproduction

Analysis started2023-12-10 06:35:36.972609
Analysis finished2023-12-10 06:35:39.336546
Duration2.36 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

"기본키값"
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct400
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean64910.5
Minimum64711
Maximum65110
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:35:39.487393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum64711
5-th percentile64730.95
Q164810.75
median64910.5
Q365010.25
95-th percentile65090.05
Maximum65110
Range399
Interquartile range (IQR)199.5

Descriptive statistics

Standard deviation115.6143
Coefficient of variation (CV)0.001781134
Kurtosis-1.2
Mean64910.5
Median Absolute Deviation (MAD)100
Skewness0
Sum25964200
Variance13366.667
MonotonicityStrictly increasing
2023-12-10T15:35:39.774286image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
64711 1
 
0.2%
64975 1
 
0.2%
64985 1
 
0.2%
64984 1
 
0.2%
64983 1
 
0.2%
64982 1
 
0.2%
64981 1
 
0.2%
64980 1
 
0.2%
64979 1
 
0.2%
64978 1
 
0.2%
Other values (390) 390
97.5%
ValueCountFrequency (%)
64711 1
0.2%
64712 1
0.2%
64713 1
0.2%
64714 1
0.2%
64715 1
0.2%
64716 1
0.2%
64717 1
0.2%
64718 1
0.2%
64719 1
0.2%
64720 1
0.2%
ValueCountFrequency (%)
65110 1
0.2%
65109 1
0.2%
65108 1
0.2%
65107 1
0.2%
65106 1
0.2%
65105 1
0.2%
65104 1
0.2%
65103 1
0.2%
65102 1
0.2%
65101 1
0.2%

"해당일시"
Categorical

HIGH CORRELATION 

Distinct40
Distinct (%)10.0%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2020-10-01 00:00:00
 
10
2020-10-01 01:00:00
 
10
2020-10-01 09:00:00
 
10
2020-10-01 02:00:00
 
10
2020-10-01 03:00:00
 
10
Other values (35)
350 

Length

Max length19
Median length19
Mean length19
Min length19

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-10-01 00:00:00
2nd row2020-10-01 00:00:00
3rd row2020-10-01 00:00:00
4th row2020-10-01 00:00:00
5th row2020-10-01 00:00:00

Common Values

ValueCountFrequency (%)
2020-10-01 00:00:00 10
 
2.5%
2020-10-01 01:00:00 10
 
2.5%
2020-10-01 09:00:00 10
 
2.5%
2020-10-01 02:00:00 10
 
2.5%
2020-10-01 03:00:00 10
 
2.5%
2020-10-01 04:00:00 10
 
2.5%
2020-10-01 05:00:00 10
 
2.5%
2020-10-01 06:00:00 10
 
2.5%
2020-10-01 07:00:00 10
 
2.5%
2020-10-01 08:00:00 10
 
2.5%
Other values (30) 300
75.0%

Length

2023-12-10T15:35:40.013872image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2020-10-01 240
30.0%
2020-10-02 160
20.0%
08:00:00 20
 
2.5%
15:00:00 20
 
2.5%
14:00:00 20
 
2.5%
13:00:00 20
 
2.5%
12:00:00 20
 
2.5%
00:00:00 20
 
2.5%
10:00:00 20
 
2.5%
11:00:00 20
 
2.5%
Other values (16) 240
30.0%

"차례값"
Real number (ℝ)

HIGH CORRELATION 

Distinct10
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.5
Minimum1
Maximum10
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:35:40.181057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median5.5
Q38
95-th percentile10
Maximum10
Range9
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.8758784
Coefficient of variation (CV)0.52288699
Kurtosis-1.224533
Mean5.5
Median Absolute Deviation (MAD)2.5
Skewness0
Sum2200
Variance8.2706767
MonotonicityNot monotonic
2023-12-10T15:35:40.768926image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
1 40
10.0%
2 40
10.0%
3 40
10.0%
4 40
10.0%
5 40
10.0%
6 40
10.0%
7 40
10.0%
8 40
10.0%
9 40
10.0%
10 40
10.0%
ValueCountFrequency (%)
1 40
10.0%
2 40
10.0%
3 40
10.0%
4 40
10.0%
5 40
10.0%
6 40
10.0%
7 40
10.0%
8 40
10.0%
9 40
10.0%
10 40
10.0%
ValueCountFrequency (%)
10 40
10.0%
9 40
10.0%
8 40
10.0%
7 40
10.0%
6 40
10.0%
5 40
10.0%
4 40
10.0%
3 40
10.0%
2 40
10.0%
1 40
10.0%
Distinct153
Distinct (%)38.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-10T15:35:41.337571image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length8
Mean length3.58
Min length2

Characters and Unicode

Total characters1432
Distinct characters238
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique71 ?
Unique (%)17.8%

Sample

1st row카누
2nd row아이패드
3rd row레고
4th row투썸플레이스
5th row리디북스
ValueCountFrequency (%)
루이비통 12
 
3.0%
아이패드 11
 
2.8%
cu편의점 9
 
2.2%
프라다 9
 
2.2%
샤넬 9
 
2.2%
구찌 8
 
2.0%
리디북스 8
 
2.0%
버버리 8
 
2.0%
갤럭시 8
 
2.0%
스타벅스 8
 
2.0%
Other values (143) 310
77.5%
2023-12-10T15:35:42.284195image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
92
 
6.4%
85
 
5.9%
38
 
2.7%
38
 
2.7%
29
 
2.0%
29
 
2.0%
25
 
1.7%
25
 
1.7%
24
 
1.7%
23
 
1.6%
Other values (228) 1024
71.5%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1379
96.3%
Lowercase Letter 52
 
3.6%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
92
 
6.7%
85
 
6.2%
38
 
2.8%
38
 
2.8%
29
 
2.1%
29
 
2.1%
25
 
1.8%
25
 
1.8%
24
 
1.7%
23
 
1.7%
Other values (214) 971
70.4%
Lowercase Letter
ValueCountFrequency (%)
c 12
23.1%
u 9
17.3%
a 5
9.6%
m 5
9.6%
b 4
 
7.7%
w 4
 
7.7%
g 3
 
5.8%
v 2
 
3.8%
h 2
 
3.8%
r 2
 
3.8%
Other values (3) 4
 
7.7%
Other Punctuation
ValueCountFrequency (%)
& 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1379
96.3%
Latin 52
 
3.6%
Common 1
 
0.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
92
 
6.7%
85
 
6.2%
38
 
2.8%
38
 
2.8%
29
 
2.1%
29
 
2.1%
25
 
1.8%
25
 
1.8%
24
 
1.7%
23
 
1.7%
Other values (214) 971
70.4%
Latin
ValueCountFrequency (%)
c 12
23.1%
u 9
17.3%
a 5
9.6%
m 5
9.6%
b 4
 
7.7%
w 4
 
7.7%
g 3
 
5.8%
v 2
 
3.8%
h 2
 
3.8%
r 2
 
3.8%
Other values (3) 4
 
7.7%
Common
ValueCountFrequency (%)
& 1
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1379
96.3%
ASCII 53
 
3.7%

Most frequent character per block

Hangul
ValueCountFrequency (%)
92
 
6.7%
85
 
6.2%
38
 
2.8%
38
 
2.8%
29
 
2.1%
29
 
2.1%
25
 
1.8%
25
 
1.8%
24
 
1.7%
23
 
1.7%
Other values (214) 971
70.4%
ASCII
ValueCountFrequency (%)
c 12
22.6%
u 9
17.0%
a 5
9.4%
m 5
9.4%
b 4
 
7.5%
w 4
 
7.5%
g 3
 
5.7%
v 2
 
3.8%
h 2
 
3.8%
r 2
 
3.8%
Other values (4) 5
9.4%

"건수값"
Real number (ℝ)

HIGH CORRELATION 

Distinct22
Distinct (%)5.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.66
Minimum1
Maximum26
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:35:42.562314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q12
median4
Q35
95-th percentile12.05
Maximum26
Range25
Interquartile range (IQR)3

Descriptive statistics

Standard deviation3.8106039
Coefficient of variation (CV)0.81772615
Kurtosis9.0810643
Mean4.66
Median Absolute Deviation (MAD)1
Skewness2.6851393
Sum1864
Variance14.520702
MonotonicityNot monotonic
2023-12-10T15:35:42.746455image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=22)
ValueCountFrequency (%)
3 97
24.2%
2 77
19.2%
4 74
18.5%
5 31
 
7.8%
6 28
 
7.0%
1 24
 
6.0%
7 15
 
3.8%
10 14
 
3.5%
8 12
 
3.0%
13 4
 
1.0%
Other values (12) 24
 
6.0%
ValueCountFrequency (%)
1 24
 
6.0%
2 77
19.2%
3 97
24.2%
4 74
18.5%
5 31
 
7.8%
6 28
 
7.0%
7 15
 
3.8%
8 12
 
3.0%
9 3
 
0.8%
10 14
 
3.5%
ValueCountFrequency (%)
26 1
 
0.2%
25 2
0.5%
22 1
 
0.2%
20 2
0.5%
19 2
0.5%
18 1
 
0.2%
16 2
0.5%
15 3
0.8%
14 2
0.5%
13 4
1.0%

Interactions

2023-12-10T15:35:38.353615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:35:37.326150image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:35:37.833908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:35:38.613635image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:35:37.515196image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:35:38.016997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:35:38.864389image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:35:37.665668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:35:38.167816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:35:42.875086image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
"기본키값""해당일시""차례값""건수값"
"기본키값"1.0001.0000.0000.484
"해당일시"1.0001.0000.0000.616
"차례값"0.0000.0001.0000.586
"건수값"0.4840.6160.5861.000
2023-12-10T15:35:43.022423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
"기본키값""차례값""건수값""해당일시"
"기본키값"1.0000.025-0.1340.961
"차례값"0.0251.000-0.6060.000
"건수값"-0.134-0.6061.0000.216
"해당일시"0.9610.0000.2161.000

Missing values

2023-12-10T15:35:39.099980image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:35:39.277395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

"기본키값""해당일시""차례값""이슈어값""건수값"
0647112020-10-01 00:00:001카누25
1647122020-10-01 00:00:002아이패드15
2647132020-10-01 00:00:003레고6
3647142020-10-01 00:00:004투썸플레이스6
4647152020-10-01 00:00:005리디북스5
5647162020-10-01 00:00:006폴라로이드5
6647172020-10-01 00:00:007카카오톡4
7647182020-10-01 00:00:008갤럭시4
8647192020-10-01 00:00:009빌리프3
9647202020-10-01 00:00:0010하겐다즈3
"기본키값""해당일시""차례값""이슈어값""건수값"
390651012020-10-02 15:00:001레고9
391651022020-10-02 15:00:002애플와치6
392651032020-10-02 15:00:003미샤6
393651042020-10-02 15:00:004배달의민족5
394651052020-10-02 15:00:005셀린느3
395651062020-10-02 15:00:006코카콜라3
396651072020-10-02 15:00:007기아자동차2
397651082020-10-02 15:00:008백설2
398651092020-10-02 15:00:009샤오미2
399651102020-10-02 15:00:0010하이네켄2