Overview

Dataset statistics

Number of variables6
Number of observations400
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory20.1 KiB
Average record size in memory51.3 B

Variable types

Numeric3
Categorical1
DateTime1
Text1

Dataset

DescriptionSample
Author코난테크놀로지
URLhttps://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=TPOOCCASION

Alerts

"채널값" has constant value ""Constant
"차례값" is highly overall correlated with "건수값"High correlation
"건수값" is highly overall correlated with "차례값"High correlation
"기본키값" has unique valuesUnique

Reproduction

Analysis started2023-12-10 06:22:46.887983
Analysis finished2023-12-10 06:22:49.019567
Duration2.13 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

"기본키값"
Real number (ℝ)

UNIQUE 

Distinct400
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20254.5
Minimum16467
Maximum32466
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:22:49.145725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum16467
5-th percentile16486.95
Q116566.75
median16666.5
Q316766.25
95-th percentile32446.05
Maximum32466
Range15999
Interquartile range (IQR)199.5

Descriptive statistics

Standard deviation6657.9512
Coefficient of variation (CV)0.32871467
Kurtosis-0.3432128
Mean20254.5
Median Absolute Deviation (MAD)100
Skewness1.2875265
Sum8101800
Variance44328314
MonotonicityStrictly increasing
2023-12-10T15:22:49.405967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
16467 1
 
0.2%
16731 1
 
0.2%
16741 1
 
0.2%
16740 1
 
0.2%
16739 1
 
0.2%
16738 1
 
0.2%
16737 1
 
0.2%
16736 1
 
0.2%
16735 1
 
0.2%
16734 1
 
0.2%
Other values (390) 390
97.5%
ValueCountFrequency (%)
16467 1
0.2%
16468 1
0.2%
16469 1
0.2%
16470 1
0.2%
16471 1
0.2%
16472 1
0.2%
16473 1
0.2%
16474 1
0.2%
16475 1
0.2%
16476 1
0.2%
ValueCountFrequency (%)
32466 1
0.2%
32465 1
0.2%
32464 1
0.2%
32463 1
0.2%
32462 1
0.2%
32461 1
0.2%
32460 1
0.2%
32459 1
0.2%
32458 1
0.2%
32457 1
0.2%

"채널값"
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
"블로그"
400 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row"블로그"
2nd row"블로그"
3rd row"블로그"
4th row"블로그"
5th row"블로그"

Common Values

ValueCountFrequency (%)
"블로그" 400
100.0%

Length

2023-12-10T15:22:49.650527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:22:50.236796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
블로그 400
100.0%
Distinct2
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
Minimum2020-05-01 00:00:00
Maximum2020-05-02 00:00:00
2023-12-10T15:22:50.399009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:50.588235image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=2)

"차례값"
Real number (ℝ)

HIGH CORRELATION 

Distinct308
Distinct (%)77.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean129.66
Minimum1
Maximum308
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:22:50.809844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile10.95
Q150.75
median108.5
Q3208.25
95-th percentile288.05
Maximum308
Range307
Interquartile range (IQR)157.5

Descriptive statistics

Standard deviation91.300514
Coefficient of variation (CV)0.70415328
Kurtosis-1.1570759
Mean129.66
Median Absolute Deviation (MAD)72
Skewness0.38745295
Sum51864
Variance8335.7839
MonotonicityNot monotonic
2023-12-10T15:22:51.046880image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 2
 
0.5%
60 2
 
0.5%
69 2
 
0.5%
68 2
 
0.5%
67 2
 
0.5%
66 2
 
0.5%
65 2
 
0.5%
64 2
 
0.5%
63 2
 
0.5%
62 2
 
0.5%
Other values (298) 380
95.0%
ValueCountFrequency (%)
1 2
0.5%
2 2
0.5%
3 2
0.5%
4 2
0.5%
5 2
0.5%
6 2
0.5%
7 2
0.5%
8 2
0.5%
9 2
0.5%
10 2
0.5%
ValueCountFrequency (%)
308 1
0.2%
307 1
0.2%
306 1
0.2%
305 1
0.2%
304 1
0.2%
303 1
0.2%
302 1
0.2%
301 1
0.2%
300 1
0.2%
299 1
0.2%
Distinct308
Distinct (%)77.0%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2023-12-10T15:22:51.673260image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length7
Mean length4.9425
Min length4

Characters and Unicode

Total characters1977
Distinct characters280
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique216 ?
Unique (%)54.0%

Sample

1st row"여행"
2nd row"쇼핑"
3rd row"공부"
4th row"약속"
5th row"운전"
ValueCountFrequency (%)
여행 2
 
0.5%
이별 2
 
0.5%
야식 2
 
0.5%
낚시 2
 
0.5%
해외여행 2
 
0.5%
선거 2
 
0.5%
설거지 2
 
0.5%
투표 2
 
0.5%
고백 2
 
0.5%
개학 2
 
0.5%
Other values (298) 380
95.0%
2023-12-10T15:22:52.532887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
" 800
40.5%
65
 
3.3%
63
 
3.2%
26
 
1.3%
19
 
1.0%
19
 
1.0%
18
 
0.9%
1 18
 
0.9%
2 17
 
0.9%
16
 
0.8%
Other values (270) 916
46.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1059
53.6%
Other Punctuation 800
40.5%
Decimal Number 118
 
6.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
65
 
6.1%
63
 
5.9%
26
 
2.5%
19
 
1.8%
19
 
1.8%
18
 
1.7%
16
 
1.5%
16
 
1.5%
16
 
1.5%
14
 
1.3%
Other values (259) 787
74.3%
Decimal Number
ValueCountFrequency (%)
1 18
15.3%
2 17
14.4%
3 13
11.0%
5 12
10.2%
6 12
10.2%
0 11
9.3%
9 9
7.6%
4 9
7.6%
8 9
7.6%
7 8
6.8%
Other Punctuation
ValueCountFrequency (%)
" 800
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1059
53.6%
Common 918
46.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
65
 
6.1%
63
 
5.9%
26
 
2.5%
19
 
1.8%
19
 
1.8%
18
 
1.7%
16
 
1.5%
16
 
1.5%
16
 
1.5%
14
 
1.3%
Other values (259) 787
74.3%
Common
ValueCountFrequency (%)
" 800
87.1%
1 18
 
2.0%
2 17
 
1.9%
3 13
 
1.4%
5 12
 
1.3%
6 12
 
1.3%
0 11
 
1.2%
9 9
 
1.0%
4 9
 
1.0%
8 9
 
1.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1059
53.6%
ASCII 918
46.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
" 800
87.1%
1 18
 
2.0%
2 17
 
1.9%
3 13
 
1.4%
5 12
 
1.3%
6 12
 
1.3%
0 11
 
1.2%
9 9
 
1.0%
4 9
 
1.0%
8 9
 
1.0%
Hangul
ValueCountFrequency (%)
65
 
6.1%
63
 
5.9%
26
 
2.5%
19
 
1.8%
19
 
1.8%
18
 
1.7%
16
 
1.5%
16
 
1.5%
16
 
1.5%
14
 
1.3%
Other values (259) 787
74.3%

"건수값"
Real number (ℝ)

HIGH CORRELATION 

Distinct315
Distinct (%)78.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1672.3375
Minimum1
Maximum21147
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:22:52.802997image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q146.75
median489
Q31686.5
95-th percentile7014.65
Maximum21147
Range21146
Interquartile range (IQR)1639.75

Descriptive statistics

Standard deviation3184.7658
Coefficient of variation (CV)1.9043798
Kurtosis12.22317
Mean1672.3375
Median Absolute Deviation (MAD)476.5
Skewness3.3124698
Sum668935
Variance10142733
MonotonicityNot monotonic
2023-12-10T15:22:53.034164image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 16
 
4.0%
4 11
 
2.8%
2 7
 
1.8%
6 6
 
1.5%
17 4
 
1.0%
8 4
 
1.0%
14 4
 
1.0%
47 3
 
0.8%
5 3
 
0.8%
59 3
 
0.8%
Other values (305) 339
84.8%
ValueCountFrequency (%)
1 16
4.0%
2 7
1.8%
3 3
 
0.8%
4 11
2.8%
5 3
 
0.8%
6 6
 
1.5%
7 2
 
0.5%
8 4
 
1.0%
9 3
 
0.8%
10 2
 
0.5%
ValueCountFrequency (%)
21147 1
0.2%
19147 1
0.2%
18027 1
0.2%
15816 1
0.2%
15312 1
0.2%
15066 1
0.2%
15063 1
0.2%
14708 1
0.2%
14083 1
0.2%
13706 1
0.2%

Interactions

2023-12-10T15:22:48.348979image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:47.306791image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:47.870395image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:48.479054image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:47.533730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:48.048628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:48.611964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:47.715210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:22:48.218839image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:22:53.180832image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
"기본키값""해당일자""차례값""건수값"
"기본키값"1.0001.0000.7380.407
"해당일자"1.0001.0000.7350.394
"차례값"0.7380.7351.0000.728
"건수값"0.4070.3940.7281.000
2023-12-10T15:22:53.333535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
"기본키값""차례값""건수값"
"기본키값"1.0000.206-0.231
"차례값"0.2061.000-1.000
"건수값"-0.231-1.0001.000

Missing values

2023-12-10T15:22:48.773115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:22:48.946836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

"기본키값""채널값""해당일자""차례값""이슈어값""건수값"
016467"블로그"2020-05-011"여행"21147
116468"블로그"2020-05-012"쇼핑"18027
216469"블로그"2020-05-013"공부"15816
316470"블로그"2020-05-014"약속"15312
416471"블로그"2020-05-015"운전"15066
516472"블로그"2020-05-016"식사"14708
616473"블로그"2020-05-017"운동"13588
716474"블로그"2020-05-018"요리"11842
816475"블로그"2020-05-019"치료"9838
916476"블로그"2020-05-0110"결혼"8072
"기본키값""채널값""해당일자""차례값""이슈어값""건수값"
39032457"블로그"2020-05-0283"직장생활"690
39132458"블로그"2020-05-0284"토론"654
39232459"블로그"2020-05-0285"가족모임"639
39332460"블로그"2020-05-0286"승진"631
39432461"블로그"2020-05-0287"후원"602
39532462"블로그"2020-05-0288"스터디"589
39632463"블로그"2020-05-0289"호캉스"555
39732464"블로그"2020-05-0290"여가"545
39832465"블로그"2020-05-0291"신혼"536
39932466"블로그"2020-05-0292"돌잔치"533