Overview

Dataset statistics

Number of variables6
Number of observations400
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory20.1 KiB
Average record size in memory51.3 B

Variable types

Numeric3
Categorical2
Text1

Dataset

DescriptionSample
Author코난테크놀로지
URLhttps://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=TPOCOLOR

Alerts

"채널값" has constant value ""Constant
"기본키값" is highly overall correlated with "해당일자"High correlation
"차례값" is highly overall correlated with "건수값"High correlation
"건수값" is highly overall correlated with "차례값"High correlation
"해당일자" is highly overall correlated with "기본키값"High correlation
"기본키값" has unique valuesUnique

Reproduction

Analysis started2023-12-10 06:39:16.993835
Analysis finished2023-12-10 06:39:19.116367
Duration2.12 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

"기본키값"
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct400
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean40601.59
Minimum15802
Maximum65155
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:39:19.247501image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum15802
5-th percentile15821.95
Q131711.75
median47693.5
Q365055.25
95-th percentile65135.05
Maximum65155
Range49353
Interquartile range (IQR)33343.5

Descriptive statistics

Standard deviation18298.247
Coefficient of variation (CV)0.45067808
Kurtosis-1.3425882
Mean40601.59
Median Absolute Deviation (MAD)15985
Skewness0.011495093
Sum16240636
Variance3.3482583 × 108
MonotonicityStrictly increasing
2023-12-10T15:39:19.483021image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15802 1
 
0.2%
47758 1
 
0.2%
47768 1
 
0.2%
47767 1
 
0.2%
47766 1
 
0.2%
47765 1
 
0.2%
47764 1
 
0.2%
47763 1
 
0.2%
47762 1
 
0.2%
47761 1
 
0.2%
Other values (390) 390
97.5%
ValueCountFrequency (%)
15802 1
0.2%
15803 1
0.2%
15804 1
0.2%
15805 1
0.2%
15806 1
0.2%
15807 1
0.2%
15808 1
0.2%
15809 1
0.2%
15810 1
0.2%
15811 1
0.2%
ValueCountFrequency (%)
65155 1
0.2%
65154 1
0.2%
65153 1
0.2%
65152 1
0.2%
65151 1
0.2%
65150 1
0.2%
65149 1
0.2%
65148 1
0.2%
65147 1
0.2%
65146 1
0.2%

"채널값"
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
"블로그"
400 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row"블로그"
2nd row"블로그"
3rd row"블로그"
4th row"블로그"
5th row"블로그"

Common Values

ValueCountFrequency (%)
"블로그" 400
100.0%

Length

2023-12-10T15:39:19.938413image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:39:20.149585image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
블로그 400
100.0%

"해당일자"
Categorical

HIGH CORRELATION 

Distinct4
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2020-05-04
103 
2020-05-03
102 
2020-05-02
99 
2020-05-01
96 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-05-01
2nd row2020-05-01
3rd row2020-05-01
4th row2020-05-01
5th row2020-05-01

Common Values

ValueCountFrequency (%)
2020-05-04 103
25.8%
2020-05-03 102
25.5%
2020-05-02 99
24.8%
2020-05-01 96
24.0%

Length

2023-12-10T15:39:20.315203image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:39:20.527781image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2020-05-04 103
25.8%
2020-05-03 102
25.5%
2020-05-02 99
24.8%
2020-05-01 96
24.0%

"차례값"
Real number (ℝ)

HIGH CORRELATION 

Distinct103
Distinct (%)25.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5375
Minimum1
Maximum103
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:39:20.750574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum103
Range102
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation28.966727
Coefficient of variation (CV)0.57317293
Kurtosis-1.1898357
Mean50.5375
Median Absolute Deviation (MAD)25
Skewness0.0076089644
Sum20215
Variance839.07127
MonotonicityNot monotonic
2023-12-10T15:39:21.000066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 4
 
1.0%
50 4
 
1.0%
72 4
 
1.0%
71 4
 
1.0%
70 4
 
1.0%
69 4
 
1.0%
68 4
 
1.0%
67 4
 
1.0%
66 4
 
1.0%
65 4
 
1.0%
Other values (93) 360
90.0%
ValueCountFrequency (%)
1 4
1.0%
2 4
1.0%
3 4
1.0%
4 4
1.0%
5 4
1.0%
6 4
1.0%
7 4
1.0%
8 4
1.0%
9 4
1.0%
10 4
1.0%
ValueCountFrequency (%)
103 1
 
0.2%
102 2
0.5%
101 2
0.5%
100 2
0.5%
99 3
0.8%
98 3
0.8%
97 3
0.8%
96 4
1.0%
95 4
1.0%
94 4
1.0%
Distinct108
Distinct (%)27.0%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-10T15:39:21.517549image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length9
Median length8
Mean length5.355
Min length4

Characters and Unicode

Total characters2142
Distinct characters142
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)0.8%

Sample

1st row"검정색"
2nd row"파랑"
3rd row"빨강"
4th row"흰색"
5th row"초록"
ValueCountFrequency (%)
검정색 4
 
1.0%
울트라바이올렛 4
 
1.0%
쑥색 4
 
1.0%
에머랄드색 4
 
1.0%
스틸그레이 4
 
1.0%
초콜렛색 4
 
1.0%
체리토마토 4
 
1.0%
군청색 4
 
1.0%
회갈색 4
 
1.0%
와인레드 4
 
1.0%
Other values (98) 360
90.0%
2023-12-10T15:39:22.334219image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
" 800
37.3%
296
 
13.8%
52
 
2.4%
35
 
1.6%
27
 
1.3%
22
 
1.0%
20
 
0.9%
19
 
0.9%
19
 
0.9%
19
 
0.9%
Other values (132) 833
38.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1342
62.7%
Other Punctuation 800
37.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
296
 
22.1%
52
 
3.9%
35
 
2.6%
27
 
2.0%
22
 
1.6%
20
 
1.5%
19
 
1.4%
19
 
1.4%
19
 
1.4%
19
 
1.4%
Other values (131) 814
60.7%
Other Punctuation
ValueCountFrequency (%)
" 800
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1342
62.7%
Common 800
37.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
296
 
22.1%
52
 
3.9%
35
 
2.6%
27
 
2.0%
22
 
1.6%
20
 
1.5%
19
 
1.4%
19
 
1.4%
19
 
1.4%
19
 
1.4%
Other values (131) 814
60.7%
Common
ValueCountFrequency (%)
" 800
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1342
62.7%
ASCII 800
37.3%

Most frequent character per block

ASCII
ValueCountFrequency (%)
" 800
100.0%
Hangul
ValueCountFrequency (%)
296
 
22.1%
52
 
3.9%
35
 
2.6%
27
 
2.0%
22
 
1.6%
20
 
1.5%
19
 
1.4%
19
 
1.4%
19
 
1.4%
19
 
1.4%
Other values (131) 814
60.7%

"건수값"
Real number (ℝ)

HIGH CORRELATION 

Distinct167
Distinct (%)41.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean683.6925
Minimum1
Maximum23742
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:39:22.922564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q15
median21
Q394.75
95-th percentile2583.2
Maximum23742
Range23741
Interquartile range (IQR)89.75

Descriptive statistics

Standard deviation2595.2291
Coefficient of variation (CV)3.7959011
Kurtosis44.072311
Mean683.6925
Median Absolute Deviation (MAD)19
Skewness6.2117494
Sum273477
Variance6735214
MonotonicityNot monotonic
2023-12-10T15:39:23.222560image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 31
 
7.8%
3 23
 
5.8%
2 21
 
5.2%
5 17
 
4.2%
4 15
 
3.8%
6 10
 
2.5%
18 9
 
2.2%
7 9
 
2.2%
8 9
 
2.2%
12 7
 
1.8%
Other values (157) 249
62.3%
ValueCountFrequency (%)
1 31
7.8%
2 21
5.2%
3 23
5.8%
4 15
3.8%
5 17
4.2%
6 10
 
2.5%
7 9
 
2.2%
8 9
 
2.2%
9 5
 
1.2%
10 6
 
1.5%
ValueCountFrequency (%)
23742 1
0.2%
22558 1
0.2%
20511 1
0.2%
19361 1
0.2%
11702 1
0.2%
9945 1
0.2%
9147 1
0.2%
8943 1
0.2%
8785 1
0.2%
8686 1
0.2%

Interactions

2023-12-10T15:39:18.318091image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:17.411017image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:17.845279image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:18.479745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:17.556069image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:18.018036image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:18.646403image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:17.709290image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:39:18.166751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:39:23.386449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
"기본키값""해당일자""차례값""건수값"
"기본키값"1.0001.0000.0000.000
"해당일자"1.0001.0000.0000.000
"차례값"0.0000.0001.0000.438
"건수값"0.0000.0000.4381.000
2023-12-10T15:39:23.563308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
"기본키값""차례값""건수값""해당일자"
"기본키값"1.0000.294-0.2921.000
"차례값"0.2941.000-0.9980.000
"건수값"-0.292-0.9981.0000.000
"해당일자"1.0000.0000.0001.000

Missing values

2023-12-10T15:39:18.834359image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:39:19.040944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

"기본키값""채널값""해당일자""차례값""이슈어값""건수값"
015802"블로그"2020-05-011"검정색"23742
115803"블로그"2020-05-012"파랑"11702
215804"블로그"2020-05-013"빨강"8943
315805"블로그"2020-05-014"흰색"7341
415806"블로그"2020-05-015"초록"4025
515807"블로그"2020-05-016"베이지"2533
615808"블로그"2020-05-017"노랑색"2216
715809"블로그"2020-05-018"분홍색"1688
815810"블로그"2020-05-019"회색"1687
915811"블로그"2020-05-0110"갈색"1660
"기본키값""채널값""해당일자""차례값""이슈어값""건수값"
39065146"블로그"2020-05-0494"딸기색"2
39165147"블로그"2020-05-0495"슬레이트그레이"2
39265148"블로그"2020-05-0496"라피스블루"1
39365149"블로그"2020-05-0497"포도색"1
39465150"블로그"2020-05-0498"헤이즐넛색"1
39565151"블로그"2020-05-0499"금갈색"1
39665152"블로그"2020-05-04100"코르크색"1
39765153"블로그"2020-05-04101"스노화이트"1
39865154"블로그"2020-05-04102"샐먼핑크"1
39965155"블로그"2020-05-04103"진남색"1