Overview

Dataset statistics

Number of variables6
Number of observations400
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory20.1 KiB
Average record size in memory51.3 B

Variable types

Numeric3
Categorical3

Dataset

DescriptionSample
Author코난테크놀로지
URLhttps://www.bigdata-telecom.kr/invoke/SOKBP2603/?goodsCode=TPOAGE

Alerts

"채널값" has constant value ""Constant
"기본키값" is highly overall correlated with "해당일자"High correlation
"차례값" is highly overall correlated with "건수값" and 1 other fieldsHigh correlation
"건수값" is highly overall correlated with "차례값"High correlation
"해당일자" is highly overall correlated with "기본키값"High correlation
"이슈어값" is highly overall correlated with "차례값"High correlation
"기본키값" has unique valuesUnique

Reproduction

Analysis started2023-12-10 06:31:49.311060
Analysis finished2023-12-10 06:31:51.508952
Duration2.2 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

"기본키값"
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct400
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean149287.21
Minimum15898
Maximum296707
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:31:51.614723image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum15898
5-th percentile15917.95
Q181699.75
median149075.5
Q3230078.25
95-th percentile279133.05
Maximum296707
Range280809
Interquartile range (IQR)148378.5

Descriptive statistics

Standard deviation84146.808
Coefficient of variation (CV)0.56365716
Kurtosis-1.2284052
Mean149287.21
Median Absolute Deviation (MAD)80998
Skewness0.072188367
Sum59714886
Variance7.0806852 × 109
MonotonicityStrictly increasing
2023-12-10T15:31:51.824672image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15898 1
 
0.2%
182718 1
 
0.2%
212756 1
 
0.2%
212755 1
 
0.2%
212754 1
 
0.2%
212753 1
 
0.2%
212752 1
 
0.2%
212751 1
 
0.2%
212750 1
 
0.2%
182721 1
 
0.2%
Other values (390) 390
97.5%
ValueCountFrequency (%)
15898 1
0.2%
15899 1
0.2%
15900 1
0.2%
15901 1
0.2%
15902 1
0.2%
15903 1
0.2%
15904 1
0.2%
15905 1
0.2%
15906 1
0.2%
15907 1
0.2%
ValueCountFrequency (%)
296707 1
0.2%
296706 1
0.2%
296705 1
0.2%
296704 1
0.2%
296703 1
0.2%
296702 1
0.2%
296701 1
0.2%
296700 1
0.2%
279145 1
0.2%
279144 1
0.2%

"채널값"
Categorical

CONSTANT 

Distinct1
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
"블로그"
400 

Length

Max length5
Median length5
Mean length5
Min length5

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row"블로그"
2nd row"블로그"
3rd row"블로그"
4th row"블로그"
5th row"블로그"

Common Values

ValueCountFrequency (%)
"블로그" 400
100.0%

Length

2023-12-10T15:31:52.051366image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T15:31:52.212936image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
블로그 400
100.0%

"해당일자"
Categorical

HIGH CORRELATION 

Distinct17
Distinct (%)4.2%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
2020-05-16
 
26
2020-05-07
 
26
2020-05-06
 
25
2020-05-14
 
25
2020-05-08
 
25
Other values (12)
273 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2020-05-01
2nd row2020-05-01
3rd row2020-05-01
4th row2020-05-01
5th row2020-05-01

Common Values

ValueCountFrequency (%)
2020-05-16 26
 
6.5%
2020-05-07 26
 
6.5%
2020-05-06 25
 
6.2%
2020-05-14 25
 
6.2%
2020-05-08 25
 
6.2%
2020-05-11 25
 
6.2%
2020-05-04 25
 
6.2%
2020-05-17 25
 
6.2%
2020-05-09 25
 
6.2%
2020-05-13 24
 
6.0%
Other values (7) 149
37.2%

Length

2023-12-10T15:31:52.379105image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
2020-05-16 26
 
6.5%
2020-05-07 26
 
6.5%
2020-05-11 25
 
6.2%
2020-05-17 25
 
6.2%
2020-05-04 25
 
6.2%
2020-05-09 25
 
6.2%
2020-05-08 25
 
6.2%
2020-05-14 25
 
6.2%
2020-05-06 25
 
6.2%
2020-05-13 24
 
6.0%
Other values (7) 149
37.2%

"차례값"
Real number (ℝ)

HIGH CORRELATION 

Distinct26
Distinct (%)6.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.6025
Minimum1
Maximum26
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:31:52.575283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q16
median12.5
Q319
95-th percentile24
Maximum26
Range25
Interquartile range (IQR)13

Descriptive statistics

Standard deviation7.1370047
Coefficient of variation (CV)0.56631658
Kurtosis-1.1964981
Mean12.6025
Median Absolute Deviation (MAD)6.5
Skewness0.0456069
Sum5041
Variance50.936836
MonotonicityNot monotonic
2023-12-10T15:31:52.775763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=26)
ValueCountFrequency (%)
1 17
 
4.2%
3 17
 
4.2%
4 17
 
4.2%
5 17
 
4.2%
6 17
 
4.2%
7 17
 
4.2%
8 17
 
4.2%
2 17
 
4.2%
17 16
 
4.0%
23 16
 
4.0%
Other values (16) 232
58.0%
ValueCountFrequency (%)
1 17
4.2%
2 17
4.2%
3 17
4.2%
4 17
4.2%
5 17
4.2%
6 17
4.2%
7 17
4.2%
8 17
4.2%
9 16
4.0%
10 16
4.0%
ValueCountFrequency (%)
26 2
 
0.5%
25 9
2.2%
24 13
3.2%
23 16
4.0%
22 16
4.0%
21 16
4.0%
20 16
4.0%
19 16
4.0%
18 16
4.0%
17 16
4.0%

"이슈어값"
Categorical

HIGH CORRELATION 

Distinct27
Distinct (%)6.8%
Missing0
Missing (%)0.0%
Memory size3.3 KiB
"어린이"
 
17
"20대(20~29세)"
 
17
"고등학생"
 
17
"10대(10~19세)"
 
17
"신생아"
 
17
Other values (22)
315 

Length

Max length13
Median length8
Mean length8.865
Min length5

Unique

Unique1 ?
Unique (%)0.2%

Sample

1st row"어린이"
2nd row"초등학생"
3rd row"청소년"
4th row"20대(20~29세)"
5th row"고등학생"

Common Values

ValueCountFrequency (%)
"어린이" 17
 
4.2%
"20대(20~29세)" 17
 
4.2%
"고등학생" 17
 
4.2%
"10대(10~19세)" 17
 
4.2%
"신생아" 17
 
4.2%
"중학생" 17
 
4.2%
"초등학생" 17
 
4.2%
"청소년" 17
 
4.2%
"70대(70~79세)" 16
 
4.0%
"30대(30~39세)" 16
 
4.0%
Other values (17) 232
58.0%

Length

2023-12-10T15:31:53.011619image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
어린이 17
 
4.2%
고등학생 17
 
4.2%
10대(10~19세 17
 
4.2%
신생아 17
 
4.2%
중학생 17
 
4.2%
초등학생 17
 
4.2%
청소년 17
 
4.2%
20대(20~29세 17
 
4.2%
50대(50~59세 16
 
4.0%
유치원생 16
 
4.0%
Other values (17) 232
58.0%

"건수값"
Real number (ℝ)

HIGH CORRELATION 

Distinct302
Distinct (%)75.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean804.3925
Minimum1
Maximum8975
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.6 KiB
2023-12-10T15:31:53.258204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile2
Q146
median333.5
Q31145.75
95-th percentile2444.4
Maximum8975
Range8974
Interquartile range (IQR)1099.75

Descriptive statistics

Standard deviation1212.4674
Coefficient of variation (CV)1.5073082
Kurtosis12.246008
Mean804.3925
Median Absolute Deviation (MAD)327
Skewness3.1002726
Sum321757
Variance1470077.2
MonotonicityNot monotonic
2023-12-10T15:31:53.478554image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 17
 
4.2%
6 10
 
2.5%
3 9
 
2.2%
2 6
 
1.5%
5 6
 
1.5%
8 6
 
1.5%
7 6
 
1.5%
11 5
 
1.2%
18 5
 
1.2%
10 4
 
1.0%
Other values (292) 326
81.5%
ValueCountFrequency (%)
1 17
4.2%
2 6
 
1.5%
3 9
2.2%
4 2
 
0.5%
5 6
 
1.5%
6 10
2.5%
7 6
 
1.5%
8 6
 
1.5%
9 4
 
1.0%
10 4
 
1.0%
ValueCountFrequency (%)
8975 1
0.2%
7428 1
0.2%
7070 1
0.2%
5862 1
0.2%
5704 1
0.2%
5677 1
0.2%
5490 1
0.2%
5393 1
0.2%
5267 1
0.2%
5241 1
0.2%

Interactions

2023-12-10T15:31:50.719954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:31:49.762670image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:31:50.250590image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:31:50.872382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:31:49.958944image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:31:50.421631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:31:51.020325image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:31:50.106147image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T15:31:50.580844image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2023-12-10T15:31:53.632996image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
"기본키값""해당일자""차례값""이슈어값""건수값"
"기본키값"1.0001.0000.0000.0000.000
"해당일자"1.0001.0000.0000.0000.000
"차례값"0.0000.0001.0000.9890.684
"이슈어값"0.0000.0000.9891.0000.861
"건수값"0.0000.0000.6840.8611.000
2023-12-10T15:31:53.808421image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
"이슈어값""해당일자"
"이슈어값"1.0000.000
"해당일자"0.0001.000
2023-12-10T15:31:53.958562image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
"기본키값""차례값""건수값""해당일자""이슈어값"
"기본키값"1.0000.044-0.0460.9900.000
"차례값"0.0441.000-0.9900.0000.849
"건수값"-0.046-0.9901.0000.0000.477
"해당일자"0.9900.0000.0001.0000.000
"이슈어값"0.0000.8490.4770.0001.000

Missing values

2023-12-10T15:31:51.251673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
A simple visualization of nullity by column.
2023-12-10T15:31:51.441406image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

"기본키값""채널값""해당일자""차례값""이슈어값""건수값"
015898"블로그"2020-05-011"어린이"5704
115899"블로그"2020-05-012"초등학생"2349
215900"블로그"2020-05-013"청소년"1922
315901"블로그"2020-05-014"20대(20~29세)"1631
415902"블로그"2020-05-015"고등학생"1516
515903"블로그"2020-05-016"10대(10~19세)"1177
615904"블로그"2020-05-017"신생아"1148
715905"블로그"2020-05-018"중학생"1138
815906"블로그"2020-05-019"30대(30~39세)"704
915907"블로그"2020-05-0110"100세"620
"기본키값""채널값""해당일자""차례값""이슈어값""건수값"
390279144"블로그"2020-05-1724"8090세대"3
391279145"블로그"2020-05-1725"6070세대"1
392296700"블로그"2020-05-181"어린이"5031
393296701"블로그"2020-05-182"초등학생"2370
394296702"블로그"2020-05-183"청소년"2220
395296703"블로그"2020-05-184"20대(20~29세)"2109
396296704"블로그"2020-05-185"고등학생"1847
397296705"블로그"2020-05-186"중학생"1213
398296706"블로그"2020-05-187"10대(10~19세)"1133
399296707"블로그"2020-05-188"신생아"1037